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CHAPTER 1 


Introduction 


In a rather short period of time, game theory has become one of the 
most powerful analytical tools in the study of politics. From its ear- 
liest applications in electoral and legislative behavior, game theoretic 
models have proliferated in such diverse areas as international secu- 
rity, ethnic cooperation, to democratization. Indeed all of the major 
fields in political science have been the recipients of important contri- 
butions from political game theoretic models. Rarely does an issue of 
the American Political Science Review, the American Journal of Po- 
litical Science, or International Organization appear without at least 
one article formulating a new game theoretic application to politics or 
providing an empirical test of an existing one. 

Nevertheless, applications of game theory have not developed as 
fast as they have in economics. One of the consequences of this un- 
even development is that most political scientists who wish to learn 
game theory are forced to rely on textbooks written by and for econo- 
mists. While there are many excellent economic game theory texts, 
their treatments of the subject are often not well-suited to the needs 
of many political scientists. First and perhaps most importantly, the 
applications and topics are generally those of interest to economists. 
For example, it is not always obvious to novice political scientists what 
duopoly or auction theory tells us about political phenomena. Alter- 
natively, there are topics such as voting theory that are indispensable 
to political game theorists which receive scant coverage in economics 
texts. Finally, many economics treatments presume some level of ex- 
posure to ideas in classical price theory. Thus, the entry barriers 
to political scientists are not only the math, but also a knowledge of 
demand curves, marginal rates of substitution, and the like. 

Certainly, there have been a few texts by and for political scientists 
such as those by Ordeshook and Morrow. However, we feel that each 
is dated both in terms of the applications but also in terms of the needs 
of modern political science. Ordeshook remains an outstanding treat- 
ment of social choice and spatial theory, yet it was written well before 
the emergence of non-cooperative theory as the dominant paradigm in 
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political game theory. Morrow does provide an accessible introduction 
to the tools of non-cooperative game theory. However, the analytical 
level of his presentation falls somewhat below the contemporary needs 
of students of political game theory. It has also been a decade since its 
publication — a decade in which there have been hundreds of important 
articles and books deploying the tools of game theory. We feel that 
there is a need to introduce today’s students to today’s literature. 

So we kept several goals in mind while writing this book. First, we 
wanted to write a textbook on political game theory instead of a book 
on abstract or economic game theory. We wanted to focus on appli- 
cations of interest to political scientists. We wanted to present topics 
that are unique to political analysis. Secondly, in writing a book for 
political scientists, we wanted to be cognizant of the diversity of back- 
grounds and interests in political science. We recognize that most 
doctoral students in the field enter graduate school with limited math- 
ematical and modelling backgrounds. However, we felt that it would 
not serve even those students to ignore the role of mathematical rigor 
and the importance of theoretical concepts in contemporary political 
models. Thus, for those students we have included a detailed mathe- 
matical appendix covering necessary tools ranging from set theory to 
basic optimization. At the other end of the spectrum are students who 
come to graduate study in political science with strong backgrounds in 
mathematics and economics. We wanted to write a book that would 
be useful to that audience as well and have chosen to provide in depth 
coverage of some more difficult and subtle concepts that are of the first 
importance to political game theory. As a result, we have included a 
number of “starred” advanced sections which provide a bit more detail 
about the analytical and mathematical structure of the models that we 
encounter. All of these can be safely skipped upon first readings for 
those not quite ready for the more technical material. 


1. Organization of the Book 


In terms of the organization, our book departs from standard treat- 
ments by including a number of topics that are either specifically rel- 
evant for political science or designed for remediation in areas which 
students of political science often have limited background. Chapter 2 
is reasonably self contained exposition of classical choice theory under 
certainty. In this chapter, we lay out the basic ideas of preferences and 
their relation to utility theory. We prove a few key results, but oth- 
erwise focus on providing the intuition and language of rational choice 
theory. We also provide a section on spatial or Euclidean preferences 
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which play a key role in voting theory as well as applications in electoral 
and legislative politics. 

In chapter 3, we consider how agents make choices under uncer- 
tainty. We develop the standard von Neumann-Morgenstern expected 
utility model, but also consider some of the most serious criticisms 
levied against it. In addition to the standard treatment of preferences 
for risk, we discuss the special implications of risk when actors have 
spatial preferences. 

Chapter 4 is a cursory review of social choice theory. The chapter is 
not intended to be a replacement for full-length texts such as those by 
Peter Ordeshook and David Austen-Smith and Jeff Banks, but primar- 
ily as a reference for those ideas and concepts in social choice theory 
that have become integral parts of formal political science, such as the 
impossibility theorem and the non-existence of the majority core. 

Chapter 5 begins our treatment of non-cooperative game theory 
which lies at the heart of contemporary formal political theory. We 
examine normal form games with complete information and present the 
fundamental solution concept of Nash equilibrium. The theoretical 
development is fairly standard but we include a number of important 
political applications. We review the standard Downsian model of 
electoral competition as well as some more recent extensions by Donald 
Wittman and Randy Calvert. We also present several models of private 
contributions to public goods based on the work of Thomas Palfrey and 
Howard Rosenthal. In chapter 6, we extend the normal form model 
where agents are uncertain about the payoffs associated with different 
strategy combinations. After presenting the relevant solution concept, 
Bayesian Nash equilibrium, we look at incomplete information versions 
of many of the models reviewed in chapter 5. This allows the reader to 
get a good sense of the strategic implications of uncertainty. We present 
the Palfrey-Rosenthal model with complete and incomplete information 
to give the reader a sense of the implications (and often the lack of ) 
of different informational assumptions. 

Chapter 7 considers dynamic, multi-stage games of complete in- 
formation and develops the notions of subgame perfection. Here we 
focus on a number of applications drawn from legislative politics, demo- 
cratic transitions, coalition formation, and international crisis bargain- 
ing. In chapter 8, we consider dynamic games where some players are 
imperfectly informed about the payoffs to different strategies. After 
developing the solution concepts relevant for such models, we explore 
a number of applications drawn from legislative politics, campaign fi- 
nance, and international bargaining. Much of this chapter is focused 
on the important and broadly applied class of signaling games. 
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Chapter 9 reviews the theory of repeated games and their applica- 
tion in political science. The role of discounting and structure of folk 
theorems in repeated games is the primary focus of the chapter. 

In chapter 12 we consider various applications of bargaining theory. 
We begin with the canonical model of Rubinstein and its majority rule 
version developed by Baron and Ferejohn. We then consider several 
examples of bargaining with incomplete information. 

In chapter 11, we present the mechanism design approach to model- 
ing institutions. Here the focus is on the selection of games that induce 
equilibrium behavior to meet certain ends. We discuss a number of 
recent applications to electoral politics and organizational design. We 
build on the work of chapter 8, drawing connections between signaling 
games and mechanism design. 

Finally, to keep the book as self contained as possible, chapter 12 
provides a review of all of the mathematics that are used in the book. 
Topics which are integral to the development of key theoretical results 
or tools for analyzing applications are drawn from the fields of set the- 
ory, real analysis, linear algebra, calculus, optimization and probability 
theory. Indeed this chapter may serve as a basis for review, self-study 
or a formal course in mathematics for students interested in working 
at the frontier of political game theory. 


CHAPTER 2 


The Theory of Choice 


The starting point for almost all of political game theory is the idea 
that individuals rationally pursue goals subject to constraints imposed 
by physical resources as well as the behavior of other actors. Such an 
assumption is often controversial. Indeed one of the most contentious 
debates in political science is the role of rationality and intentionality 
as a predictor of political behavior. However, we will defer debates 
between homo economicus and homo sociologicus and jump right into 
the classical model of rational choice. 

For almost all of our purposes, it is sufficient to define rationality 
in terms of a few simple ideas: 


(1) Confronted with any two options, which we might denote x 
and y, an individual can determine whether he does not prefer 
option x to option y or whether he does not prefer y to x, or 
both. When preferences satisfy this property, we say they are 
complete. 

(2) Confronted with three options x, y, and z, if an individual does 
not prefer y to x and does not prefer z to y then it must be the 
case that she does not prefer z to x. Preferences satisfying 
this property are transitive. 


Roughly speaking, our definition of rational behavior is that con- 
sistent with complete and transitive preferences. Sometimes behavior 
dictated solely by properties 1 and 2 is called “thin” rationality. This 
is because properties 1 and 2 are not predicated in any way on as- 
sumptions about the substantive content of human desires. Thus, 
thin rationality contrasts with “thicker” notions of rationality where 
specific goals such as wealth, status, or fame are postulated. The thin 
characterization of rationality is consistent with a very large number 
of these substantive goals. In principal, thinly rational agents could 
be motivated by any number of factors including ideology, normative 
values, or even religion. As long as these belief systems produce com- 
plete and transitive orderings over personal and social outcomes, we can 
model the behavior they produce using the classical model of choice. 
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While it may be appealing to deal with models that are indepen- 
dent of assumptions about specific goals, it will often be desirable to 
make stronger assumptions about preferences. For example, we might 
assume that interest groups wish to maximize the wealth of its mem- 
bers or that politicians wish to maximize their reelection chances. In 
subsequent chapters, we will explore models that makes these types of 
assumptions about agent preferences. But rational models may be just 
as useful for models of activists who wish to minimize environmental 
degradation or the number of abortions for principled, non-material 
reasons. 

In the following sections, we develop the classical theory of choice 
under certainty. By certainty, we mean simply that the agent has suf- 
ficient information about the choice environment that she can perfectly 
predict the consequences of each of her actions. Thus, certainty means 
only that there is no analytical difference between assuming that polit- 
ical actors choose actions based on the desired outcomes which result 
from those actions, or that they choose those outcomes directly. In 
later chapters, we will examine choice under uncertainty — the actor’s 
lack of knowledge of some feature of the choice environment leads her 
to choose actions which have uncertain consequences. 


1. Finite Sets of Actions and Outcomes 


We begin by considering the simple case of an agent who faces a 
finite numbers of actions from which to choose. We denote these 
choices as a set A = {q1,...,a,}. For example, a leader involved in 
an international crisis might face the following set of alternatives A = 
{send in the troops, negotiate, do nothing} whereas an American voter 
might choose among A = {vote Democrat, vote Republican, abstain} . 

As mentioned above in this chapter we assume that agents have 
complete information, that is they are sufficiently knowledge about 
the context of their choices that they can perfectly predict the con- 
sequences of each action. To capture this idea formally, we define 
outcome sets X = {2,...,v,}. Following one of our examples, these 
might be X = {win major concessions and lose troops, win minor con- 
cessions, status quo}. The assumption of certainty then implies that 
each action a € A maps directly on to one and only one x € X. For- 
mally, we assume that there exists a function x : A — X that maps 
each action into a specific outcome. We also assume that all of the 
outcomes in X are feasible, that is each is the consequence of at least 
one action. Formally, x; is feasible if there exist an a € A such that 
x(a) =2;. With the assumptions of certainty and feasibility, it makes 
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little difference whether we speak of an agent’s preferences over actions 
or his preferences over outcomes. Thus, we concentrate on the agent’s 
preferences over outcomes and are uninterested in the action that she 
must choose to attain the desired outcome. In chapter 3, the assump- 
tion of uncertainty or incomplete information makes the distinctions 
between actions and outcomes relevant. 

We now turn to the concept of preferences and the types of re- 
strictions that our two simple notions place on what outcome rational 
individuals may choose. Formally, preferences are modelled as a bi- 
nary relation R which represents “weak preference.” The notation 
x;Rx; means that outcome x; is not preferred to policy x; or that 2; is 
“weakly” preferred to z;.' To help cement ideas, note that R is similar 
to the binary relation > (greater than or equal) which operates on real 
numbers. 

Given the weak preference relation R, we define two other important 
binary relations: strict preference and indifference. 


DEFINITION 2.1. Given any x,y € X we say «Py if and only if 
xrRy and not yRx. We say rly iff Ry and yRe. 


Accordingly, P denotes strict preference and J denotes indifference. 
Returning to the example of > on X, the strict preference relation 
derived from > is equivalent to the relation > and the indifference 
relation is equivalent to the relation =. 

While preferences, in the form of binary relations, are a useful start- 
ing point, it is choices that we are ultimately interested in. Given a 
set of preferences, we could hardly call an agent’s behavior rational 
unless she selected an outcome that she valued at least as much as any 
other. Consequently, we expect a rational agent to choose an «* € X 
for which x* Ry for every y € X. However, without adding a little bit 
more structure on to her preferences, there is no guarantee that such a 
maximal outcome exists. ‘Thus, we now turn to the conditions on X 
and R that insure such a “best” choice is meaningful and well-defined. 
We start with the following formal definition. 


DEFINITION 2.2. Given a set X and weak preference relation R on 
X, the maximal set M(R,X) C X is defined as follows M(R,X) = 
{ceEX:rcRyVyEX} 


Thus, a fundamental tenant of rationality should be that agents 
choose outcomes from the maximal set. Of course, this require- 
ment only makes sense if the maximal set is not empty i.e. M(R,X) # 


‘Formally, R is a subset of X x X such that if (a,y) € R than xRy. 
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(. Thus, we are most interested in the properties of preferences that 
guarantee that M/(R, X) has at least one element. 

The most obvious problem that might lead to an empty maximal 
set is that R is silent between a pair of outcomes say x and y._ If 
neither «Ry or yRzx then it is not clear what a “rational choice” would 
be. Two conditions insure that all elements of X are ordered. 


DEFINITION 2.3. A binary relation R on X is 

(i) complete if for all x,y € X with « # y, either xRy or yRx or 
both. 

(ii) reflexive if for alla € X, «Ra. 


Completeness simply means that the agent can compare any two 
outcomes. This is probably not a terribly controversial assumption 
(though we all know people who can’t seem to make their minds up). 
Reflexivity is a technical condition and some authors choose to define 
completeness in a slightly different manner that also captures what we 
call reflexivity. 

While completeness and reflexivity get us closer to a “rational” pref- 
erence relation they are not sufficient. We need to rule out problems 
like «Ry, yRz and zRx. The problem with such preferences is that 
there is no reasonable choice-why choose y when you can choose 2, 
why choose x when you can choose z, and why choose z when you can 
choose y. Each of the following conditions on preferences resolve this 
problem. 


DEFINITION 2.4. A binary relation R on X is 

(1) Transitive if for all x,y,z €X if xRy and yRz then xRz. 

(2) Quasi-transitive if for all x,y,z € X ifxPy and yPz then xPz. 

(8) Acyclic if for all {x,y, z,....,a,b} € X ifxPy and yPz ... and 
aPb then «Rb 


Note the subtle differences among these definitions. Transitivity 
and quasi-transitivity may seem innocuous but they are reasonably 
strong assumptions which might be violated even by very reasonable 
behavior. For example, suppose X is a set of 1000 different bottles 
of beer. Beer 6; has had one drop of beer replaced with one drop of 
plain water while bj has had two drops replaced and so on to bj 99. 
Unless you are a master brewer, b;/b2, and boIb3,..., and bgggIbyo00- 
Since xIy implies «Ry (by the definition of J) this would imply that 
biooo Rbgg9,..., 62 Rb; and if the relation is transitive we are left with 
biooo b;. But clearly, b; Pbiq99.”2. Note however that the assumption of 


This is approximately the difference between Guiness and Coors Light. 


1. FINITE SETS OF ACTIONS AND OUTCOMES 9 


acyclicity does not suffer from this problem and is typically sufficient 
for our purposes. However, despite the problems associated with tran- 
sitivity, we will maintain it as an assumption (rather than acyclicity) 
to simplify many of the results below. 


DEFINITION 2.5. Given a set X a weak ordering is a binary relation 
that is complete, reflexive and transitive. 


It is not difficult to see that transitivity rules out exactly the cycle 
xRy, yRz and zRx considered above. Note that our recurring example 
of > satisfies all of the conditions for a weak ordering. We can now 
state our first result. 


THEOREM 2.1. If X is finite and R is a weak ordering then M(R, X) # 
0. 


Thus, we can guarantee that there is a best choice so long as we are 
willing to assume that the choice set is finite and that R is complete, 
reflexive, and transitive. The intuition behind this theorem is quite 
straightforward. Consider an outcome set X with three elements say 
x,y, and z. If M(R,X) = 0, then by definition x and y must be weakly 
preferred to z, y and z must be weakly preferred to x, and that x and 
z must be weakly preferred to y.. Thus, the only possibility that does 
not violate transitivity is that x[yIz which implies that M(R,X) =X. 
We can extend this logic to any size X. 


PRooF. Assume that X is finite and R is complete, reflexive, and 
transitive. We establish the result by induction on the number of 
elements in X. 

Step 1: If X has 1 element (ie. X = {x}), then by reflexivity 
xRex and thus M(R,X) = {x}. 

Step 2: We show that if it is true that for any X’ with n elements 
R’ a weak ordering implies that M(R’, X') # @ then for any X with 
n+ 1 elements when R is a weak ordering on X, M(R,X) #0. 

-Proof of step 2: assume that for any X’ with n elements R’ 
a weak ordering on X’ implies that M(R’,X’) 4 @. Now consider 
a set X with n+ 1 elements. For arbitrary x € X it is true that 
X = X'U {x} with X’ a set having n elements. By assumption 
M(R’,X') £0. So for an arbitrary y € M(R’, X') either yRx or xRy 
or both by completeness. 

-If yRx then since y € M(R’, X') we have yRz for all z € X’U {x} 
and thus y € M(R, X) and the step 2 result is established. 

-If it is not the case that yRx then we have rRy. Since y € 
M(R’, X’) we have yRz for any z € X’. Thus for any z € X’ we have 
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xRy and yRz.Since R is transitive this implies that we have 7Rz for 
any z € X’. This and «Ry imply that for any w € X’ we have rRw 
and thus « € M(R, X) and the step 2 result is established. 

Step 3: By steps 1 and 2 for any finite sized X if R is a weak 
order on X then M(R,X) 40. O 


It turns out that a weak ordering is not needed for M(R,X) but 
the proof is a bit more complicated so we leave it for an exercise. 


THEOREM 2.2. Assume X is finite and R is a complete and reflexive 
binary relation on X. M(R,S) #0 on any S Cc X (except S = 0) iff 
R is acyclic. 


Even with a finite choice space and no uncertainty the theory of 
choice is fairly rich. Austen-Smith and Banks (1999) is a good first 
source for students interested in going further. Many economists and 
psychologists, have been concerned about the assumption of complete- 
ness and a theory of choice without this condition has been derived. 

In the next, more technical, section, we consider rational choice 
when the outcome space is not finite, such as the real line. We derive 
an analog to theorem 1 for non-finite choice sets. While the results 
are conceptually similar, additional mathematical structure needs to 
be placed on the choice sets and preferences. 


2. Continuous Outcome Spaces* 


2.1. Non-emptyness of M(R, X). Examination of the argument 
for theorem 1 demonstrates that the fact that the choice space was finite 
was useful. This allowed us to prove the result by induction for any 
number of outcomes. However, when there is an infinite number of 
choices, this approach is mathematically inappropriate. Thus, when 
agents choose from a choice space that is a continuum (e.g. the set of 
real numbers denoted R or the set [0,1] = {« € R: x > 0 and x < 1}) 
more structure on preferences is needed to insure that M(R,S') 4 0. 
Two simple examples demonstrate why things can go awry. 


EXAMPLE 2.1. Let X = (0,1) (or let X = R') and let R on R' be 
equivalent to > so that xRy iffx>y The set M(>,X) is empty. 


To see why M(>, (0,1)) is empty, note that for every x € X there 
exists some y € X for which y > x. Thus there can be no x such that 
xRy for all y € X. In this example the fact that (0,1) has no biggest 
element results in the emptiness of the maximal set. Note however that 
if X were a closed interval such as [0,1], we wouldn’t have a problem 
as M(>,|0,1]) = 1. This is a strong hint that general results about 
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the non-emptiness of the maximal set may depend on the choice set 
being “closed.” 


EXAMPLE 2.2. Let X = [0,1] and define R on R' as follows: xRy 
UL. YS 5 and oS yor Yay > $ anda <y orifx> ; andy < 5. 
The set M(R,X) is empty. 


To see this note that no element of [0,5] can be in M(R,X) as 
any element of (3, 1] is weakly preferred. However, elements of (5, 1] 
cannot be part of M(R, X) for reasons identical to that of the previous 
example. Here the problem is not with X — it is a closed interval as 
we required to make the first example work. Instead, the problem is 
with R. It jumps around at 5. Outcomes slightly less than or equal 
+ are among the least preferred while those slightly greater are among 
the most preferred. It is this “discontinuity” in preferences that leads 
to the empty maximal set in the example. 

Before turning these examples and intuitions into general axioms, 
we need to review a few mathematical concepts.? We begin with the 
assumption that preferences are defined on n-dimensional Euclidean 
space, and consider choice from subsets, X C R”. A point in such 
a space can be written as a vector x = (x',2x?,....,2") where each 
coordinate x’ is a point in R?. 

One of our main concerns is whether the set X is open or closed. 
These concepts can be grasped with the simplest example of R!. In 
this case a set A C X is termed open if for every point x € A there 
is some € > 0 such that for any y € X satisfying |x — y| < € it is the 
case that y € A. Therefore, a set is open if given any point in the 
set, all the points close to it are also in the set. Clearly, the set (0,1) 
is open. For each point in the set, there are some points higher and 
some points lower which are also in the set. Thus, for any point z, 
we can choose € so that « — ¢ and x +€ are also in the set. We say 
that a set is closed if its complement is open. Therefore, since (0, 1) is 
open, (—oo, 0] U[1, 00) is closed. Intervals such as [0, 1] are also closed. 
Some sets may be neither open or closed such as [0, 1). 


3More precisely we need a few Topological concepts. Students interested in 
further study of choice theory would be well served examining the mathematical 
appendix to this book or better, yet, a text on Real Analysis. An approach- 
able introductory text is: Gaughan, Edward. 1993. Introduction to Analysis, 4ed. 
Brooks/Cole Publishing Company. A more complete text is: Kolmogorov, A.N. 
and S.V. Fomin. 1970. Inroductory Real Analysis. Dover. 
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To generalize these concepts to the n-dimensional Euclidean space, 
we begin with a measure of distance call the norm. 


Iz — yll = 


The quantity ||z — y|| is the distance between points x and y and gener- 
alizes the absolute value used in R!. Given this definition of distance, 
we can generalize the notion of an interval into that of a “ball.” 


DEFINITION 2.6. An open ball of radius ¢ > 0 and centerx EX , 
is denoted B(x,e) = {y € X : ||x — y|| < e}. 


Now it is easy to generalize the concept of openness. A set is open 
if the set contains an open ball around each point for some ¢ > 0. 


DEFINITION 2.7. A set A C R" is open if for every x € A there is 
some € > 0 such that B(x,¢) C A. 


Just as before, a set closed if its complement is open. Thus, closed 
sets have the property that some points are on the boundary so that 
there does not exist an open ball that does not contain points outside 
the set. 


DEFINITION 2.8. A set A C R" is closed if its complement B = 
R"\A is an open set. 


Recall our first example. Since X is an open set, for each x in 
X there is an open ball around x that is also in X. Since each of 
these balls contain points weakly preferred to X, no maximal set can 
exist. However, if X = [0,1], any open ball around 1 contains points 
outside of X. Since all of the points preferred to 1 lie outside of [0, 1], 
M(=,|0,1)) =1. However, note that closedness itself is not sufficient. 
Recall that (—oo, 0] U[1, co) is a closed set, but M(>, (—co, 0) U[1, o0)) 
is empty. The problem of course is that there is no upper bound on 
this set, so for any x there is ay > x so that yRx. Thus, another 
important condition is boundedness. 


DEFINITION 2.9. A set A C R” is bounded if there exists a finite 
number b such that for every x € A it is the case that ||z —O|| < b 
where 0 is the vector (0,...,0). 


The set (—oo, 0] U [1, 00) clearly fails this criteria so we can rule 
it out by requiring that choice sets be bounded. It is easy to see in 
example 1 so long as X is closed and bounded M/(>, X) is non-empty. 
In R”, we call sets that are both closed and bounded compact. 
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DEFINITION 2.10. A set A C R"” is compact if it is closed and 
bounded. 


Since all examples or problems and problems in this book will deal 
with subsets of Euclidean spaces, we could stop here. However, in 
arbitrary spaces, the equivalence between compactness and closed and 
bounded does not hold. It turns out that the proof of our main result 
is easier if we consider a more general definition of compactness (even 
though we lose some of the intuition of our examples). The more 
general definition of compactness is based on sets known as open covers. 
An open cover for a set A is a collection of open sets whose union 
contains A. 


DEFINITION 2.11. Given a set A, an open covering of A is a collec- 
tion of sets {Og}oco where © is an arbitrary index set and Og is open 
for every 0 € © such that A C {UgeeQo} (in other words if x € A then 
there is some 0 € O such that x € Og). 


Given this definition, we say that A has a finite sub-cover if from 
every open cover we can select just a finite number of the open sets and 
be assured that A is covered by this finite collection. The existence of 
such a sub-covering is equivalent to the compactness of A. 


DEFINITION 2.12. A set A is compact if for any open covering{ Og }oco 
of A there exists some finite set B C O, such that the finite covering{Og hoc 
is a covering of A i.e. A C UpgeBOs. 


Since the previous two definitions are subtle for those not familiar 
with analysis an example is warranted. Consider the space R' and two 
subsets |0, 1] and (0,1). We already know that (0,1) is not compact, as 
it is not closed and we have concluded that in Euclidean space compact 
sets are closed and bounded. To demonstrate that (0, 1) is not compact 
using the open covering definition, consider the following open covering 
of (0,1). For each 6 € O = {3,4,5,....... }, let Op = (4,1- 3). This 
is a collection of open intervals centered at 5, and the width of the 
intervals approaches 1 as 0 gets big. Is {Og}9cq and open covering of 
(0,1)? Yes, for any element in x € (0,1) you can pick a @ big enough 
so that x € Og. So we have constructed an open covering of (0, 1). 
Now our definition of compactness says that if (0,1) is compact we 
need to be able to find a finite subset B C {3,4,5,....... } and show that 
(0,1) C UsgepOg. But for a finite set B, there is a finite largest element 


6* € B“ This means that the value a is strictly larger than 0 and 


“You could actually prove this sentence by noting that > is a weak ordering 
and applying our result about the non-emptyness of the maximal set for finite sets. 
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since (0,1) contains points arbitrarily close to 0, % is strictly larger 
than some element of (0,1). Accordingly for any finite collection of 
subsets in the open covering (i.e. selection of B that is finite), we can 
find an element of (0,1) that is not contained in any set Op for 6 € B. 
Thus we have applied the open-covering definition to show what we 
already knew, (0,1) is not compact. The interested reader should 
try to proceed in the other direction, showing that [0,1] is compact 
according to the open-covering definition.” 

Having elucidated properties that X can satisfy (e.g. compactness), 
we turn to the properties that we would like to R to satisfy. Not sur- 
prisingly, given example 2, we want R to be “continuous” in a specific 
way. To define continuity, we use the concept of the upper contour 
set.° Given a binary relation R on R” the strict upper contour set of a 
point x € R" is P(x) = {y € R": yPx}. The strict lower contour set of 
point x is the set P~!(x) = {y € R” : Py}. So the upper contour set 
of x contains the points that are preferred to x and the lower contour 
set of x contains the points that x is preferred to. Similarly, the level 
set of x is the set of points for which the agent is indifferent to x or 
I(x) = {y € R": yRaz and Ry}. 


DEFINITION 2.13. A binary relation R on R" is 

(i) upper continuous if for all x € R", P(x) is open 
(ii) lower continuous if for all x € R", P~!(x) is open 
(iii) continuous if is both lower and upper continuous. 


Consider the implications of these conditions. When preferences 
are complete, any point y that is very close to x is either in P(x), 
P~'(ax), or I(x). When preferences are continuous, y € P(x) or y € 
P~'(x) implies that points sufficiently close to y will also be in the 
respective set. To see how continuity rules out anomalous behavior, 
recall example 2. There P~!($+¢) = (—oo, $]U($ +e, 1] which is not 
an open set. Thus, the jump in preferences exhibited in the second 
example is ruled out when preferences are lower continuous. 

We can now state the sufficient conditions for a non-empty maximal 
set. 


°We direct the reader to the famed Heine-Borel theorem in any of the cited 
texts on Real-analysis which relates the topological open-covering and Euclidean 
closed-and bounded definitions of compactness for R”. Gaughan (1993) presents a 
particularly detailed proof of the result for R!. 

Tn political science, the upper contour set is often referred to as the “preferred 
to set”. Keith Krehbiel has pointed out to both authors on numerous occassions 
that this terminology (along with many others) contains a redundancy. Thus, he 
and we implore all readers to use our preferred “preferred set.” 
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THEOREM 2.3. If X C R” is non-empty and compact, and R on R” 
is complete, reflexive, transitive and lower continuous, then M(R,X) # 


The proof of this result is more technical than most other sections 
of this book. The result also holds on for arbitrary topological spaces. 
This allows us to apply it to choice problems in which x is a infinite 
sequence of outcomes, a function, or a probability distribution. 


PRooF. Assume that X is non-empty and compact, and R on X 
is complete, reflexive, transitive and lower continuous. ‘To establish 
a contradiction, assume that M(R,X) = @. Thus, every point in X 
is contained in P~'(a) for some a € X. Since R is lower continuous 
every such P~!(a) is open. This means that {P~'(a)}acx is an open 
covering of X. Since X is compact, there exists a finite set of points 
B c X for which the collection {P~'(a)}aeg is also a covering of 
X. That is if € X then x € P~'(a) for some a € B. But we 
know from a previous result that M(R,B) 4 @, since B is finite and 
R is complete, reflexive and transitive. Thus a point x* € M(R, B) 
exists. Now consider any arbitrary point y © X. Either y is an 
element of M(R, B) or it is not. By definition, if y € M(R, B) then 
x«*Ry. Ify ¢ M(R,B) since {P~'(a)}acp covers X there is some 
a € B such that y € P-'(a). This means that aRy. However, 
since x* € M(R,B) we know that x*Ra. Since R is transitive on 
X, this implies that 7*Ry. Thus, we have shown that for all y € X, 
x*Ry. This means that «* € M(R,X), contradicting the assumption 
and establishing the non-emptiness of M/(R, X).m O 


It is important to note that the theorem only establishes sufficient, 
not necessary conditions, for a non-empty maximal set. In particu- 
lar, we will encounter situations where X is either unbounded or not 
closed and R is discontinuous. In each of these possibilities, the non- 
emptiness of M(R, X) has to be established by other means. Violations 
of the compactness of X will generally require stronger assumptions 
about R while violations of continuity will require more structure on 
X. 


2.2. Uniqueness of M/(R, X). Since we develop choice models 
to make predictions about behavior, it is certainly preferable that the 
model produce a single prediction, rather than a range of possible out- 
comes. Thus, it is valuable to know whether or not M(R,X) has a 
unique element or whether a larger set of choices is consistent with 
rational behavior. 
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When the choice set is finite, we can typically only guarantee a 
unique element of M(R, X) by assuming that all preferences are strict. 
Without indifference and a finite choice set, M(R, X) must have only 
a single element if it exists. 

When the choice space is not finite, we can impose sufficient struc- 
ture to insure that M(R,X) has only one element. We will need one 
condition on X and one condition on FR to attain this result. The first 
condition is that X be a convex set. This assumption requires that if 
x and y are in X certain combinations of 7 and y must also be in X. 


DEFINITION 2.14. X C R” is convex if for any x,y € X and the 
point Ax + (1—A)y is in X for every d € [0,1]. 


The point Ax + (1 — A)y is often called the convex combination (or 
a weighted average) of x and y. As an example the set [0,1] is convex 
because for any two points in the set, any point in between these two 
points is also in the set. Alternatively, X = [0, $] U [2,0] is not since 
5A + 3(1—) ¢ X for any \ € (0,1). Thus, convexity requires that 
there are no “holes” in the outcome set. When the outcome has more 
than dimension, convexity also requires that its surface not have any 
appendages. Think about your hand. Convex combinations of points 
on your thumb and index finger are not part of it.’ 

We will see that an important property is that preferences be convex 
as well. 


DEFINITION 2.15. Preference R on the convex set X is strictly con- 
vex if for any distinct points x,y € X if Ry then [Ax + (1 — A)y] Py 
for any X € (0,1). 


Essentially, convex preferences have the property that if the agent 
prefers x to y she should also prefers convex combinations of x and y 
to y. Strict convexity goes a step further. Even if the agent is only 
indifferent between x and y, she should still prefer the convex combina- 
tion to either x or y. We leave as an exercise that the strict convexity 
of R implies that the lower contour sets P~'(x) are convex. Since the 
lower contour sets are convex, they cannot have holes, appendages, or 
flat spots. 

The following result is easy to establish. 


THEOREM 2.4. If X is convex and R on X 1s strictly convex, then 
if M(R, X) is non-empty, it contains a single element. 


PROOF. By way of a contradiction assume that X is convex, R 
is strictly convex, and two distinct policies x,y are both in M(R, X). 


7Game theorists spend a lot of time contemplating such ironies. 
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For arbitrary A € (0,1) the point [Ax + (1 — A)y] is in X since X is 
convex. But since R is strictly convex, [Ax + (1—A)y] Py. But this 
contradicts the assumptions that y € M(R,X). Thus the result is 
established. O 


The importance of the last two theorems is clear. When the choice 
set is compact and a weak order is lower continuous, a “rational” choice 
exists. When the choice set is convex and the preference ordering is 
strictly convex, any optimal choice is unique. 


3. Utility Theory 


So far our model of choice and rationality is based on the use of 
binary preferences and the maximal set. However, binary operators 
are hard to work with except in the most trivial models. Numbers 
on the other hand are easy to work with. So if we can associate a 
number with each element of the outcome set, then we can just use 
the > operator to compare alternatives. In this section we explore 
the conditions under which it is possible to represent outcome sets as 
sets of real numbers so that we can use > as the preference operator. 
In other words, we would like to represent preferences using a utility 
function (a real valued function with domain X) such that 


u(x) > u(y) implies xRy 
u(x) > u(y) implies «Py 
u(x) = u(y) implies rly 


The idea of utility has been the subject of numerous philosophical and 
moral debates over the past 300 years, but again we will use a narrow 
definition. Utilities simply numerical representations of preferences for 
which > is the appropriate preference operator — we imbue them with 
no additional normative content. 

At our current level of generality, utility functions are ordinal as 
they are used only to rank alternatives. In particular, they are not 
used to tell is how much something is preferred to something else. The 
value u(x) — u(y) has no meaning, because any function w such that 
w(x) > w(y) if and only if u(x) > u(y) represents exactly the same 
preferences as u. This indicates that comparing utilities across agents 
is generally not a meaningful exercise. However, as we discuss in the 
next chapter, the standard model of choice under uncertainty presumes 
that utility functions contain more than simple ordinal information. 

The following is a formal definition of a utility function. 
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DEFINITION 2.16. Given X and R on X we say the utility function 
u:X —R! represents R if for all x,y € X u(x) > uly) iff cRy. 


From this definition it is quite easy to show that u(x) > u(y) if 
and only if rPy and u(x) = u(y) if and only ly. When X is finite 
the existence of a utility representation of R hinges only on R being 
complete, reflexive,and transitive. 

Just as we did in the last section, we wish to characterize the agent’s 
optimal choice. Let x be a maximizer of u: X — R! if u(x) > u(y) 
for all y € X. As the next result shows the existence of a maximizer 
and the non-emptiness of M(R, X) are equivalent. 


THEOREM 2.5. If the function u(-) is a utility representation of R 
on X then M(R, X) = arg max,ex{u(z)}. 


PRooF. To show that M(R, X) C argmax,zex{u(x)}, assume that 
u(-) represents Ron X and that v7’ € M(R,X). The latter assumption 
implies that x’ Ry for all y € X. This and the former assumption imply 
that u(x’) > u(y) for ally € X. Thus x € arg max,cx{u(x)}. To show 
that arg max,cx{u(x)} C M(R,X) assume that u(-) represents R on 
X and that 2’ € argmax,cy{u(x)}. The latter assumption implies 
that u(x’) > u(y) for all y € X. This and the former assumption 
imply that x’Ry for ally € X. Thus z € M(R,X).m O 


If X is finite and R is complete, reflexive and transitive, we know 
that M(R,X) is non-empty (by theorem 1), thus a maximizer of u(x) 
must exist. However, if X is not finite, further conditions on X and 
the function are needed to ensure the existence of maximizers. In 
the next advanced section we consider utility functions on non-finite 
outcome spaces. 


4. Utility representations on Continuous Outcome Spaces* 


We now review some basic properties of functions. The first desir- 
able property of utility functions is continuity. 


DEFINITION 2.17. We say a function f : X — R! is continuous 
if for every x € X the following is true: For every « > 0 there exists 
some 6 > 0 such that if ||x — y|| < 6 | f(x) — fly)| <e. 


As is often taught to high school students, a continuous function 
is one that you can draw without lifting the pencil. Substantively, 
a continuous utility function is one that produces close utilities for 
outcomes that are close together. 

The following sufficient conditions on preferences ensure that a con- 
tinuous utility representation exists. 
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THEOREM 2.6. (Debreu 1959) If X Cc R” and R is complete, re- 
flexive, transitive, and continuous, then there exists a continuous utility 
function u: X — R! that represents R. 


We will not undertake the proof of this claim. However the converse 
is not difficult to establish which we leave as an exercise. A result 
analogous to Theorem 2.3 is the following. 


THEOREM 2.7. If X C R” is compact andu: X — R? is continuous 
then a maximizer exists. 


This result is sometimes known as the Weierstrass Theorem. We do 
not prove the result here (see Royden for a proof), since Theorem 2.3 is 
actually a result showing that only lower continuity and compactness 
are needed. 

As we pointed out earlier, utility functions are arbitrary. Some 
texts call this an ordinal notion of utility as opposed to a cardinal 
notion of utility. There is nothing interesting about the particular 
value of a utility function at a specific point x € X. All that matters 
is the ordering of u(x) and u(y) for any two x,y € X. We say that 
f : R' = R’ is a strictly increasing function if for all z,y Ee X x#>y 
implies that f(x) > f(y). Utility functions are defined only up to a 
strictly increasing transformation. This means that if u: X — R! 
represents R on X then fou: X — R! represents R on X where 
fou: X —R?' is represents f(u(x)). Thus, scaling a utility function 
is of no consequence. 

Thus, far we have not established any results that allow us to find 
the maximizer of a utility function. Fortunately, if we assume that 
utility functions are differentiable, the tools of calculus will allow us to 
characterize optimal choices. The mathematical appendix reviews key 
results from calculus. 


5. Spatial Preferences 


In most applications in economics, outcomes spaces are denomi- 
nated in money (incomes, wealths, wages, profits etc.) or commodities 
(widgets, gizmos, chili burritos). It is sensible to assume that larger 
outcomes are preferred to smaller outcomes (except perhaps in the case 
of chili burritos). In other words, many of the preferences considered 
in economics are non-satiable in that agents either believe more is al- 
ways better (i.e. money) or less is always better (air pollution). In 
political game theory, however, many of the outcomes we wish to study 
are policies such as taxes, welfare benefits, abortion restrictions where 
at least some agents have a most preferred outcome that is neither zero 
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or infinite. A voter’s utility may be increasing in tax rates below some 
level and decreasing for higher levels. A voter may prefer restrictions 
on abortion only so stringent as outlawing them in third trimester but 
not more so. Thus, it often necessary to assume that political actors 
have satiable preferences. Formally, we can say an agent has such 
preferences when M/(X,R) contains elements that are interior to the 
outcome space X. Similarly, preferences are satiable when the maxi- 
mizer of u: X — R is in the interior of X. Figure 2.1 illustrates the 
differences between satiable and non-satiable preferences. 


Insert Figure 2.1 Here 


The most common application of satiable preferences is the spatial 
model where it is assumed that policy outcomes can be represented 
as points lying in a subset of R?. In principal, one could specify very 
general preferences over this space, but in practice (and the remainder 
of this book) it is generally assumed that voters have single-peaked 
and symmetric preferences. We will discuss single peakedness in more 
detail in the chapter on social choice, but for now we will simply note 
that it implies that the agent’s maximal set has a single element and 
that the utility function has a single maximizer. This most preferred 
policy outcome is known as the agent’s ideal point. The assumption 
of symmetry requires that the agent’s utility declines at the same rate 
regardless of direction. This implies that preferences are a decreasing 
function of the distance between the policy outcome and the agent’s 
ideal point. 

If we assume that the policy space is one-dimensional, single-peaked, 
symmetric preferences are represented by utility functions of the form 
u;(z) = h(—|x — z;|) where z; is agent i’s ideal point and h : R' > R! 
is an increasing function. The two most popular examples are the lin- 
ear, u;(x) = —|x — z;| and quadratic utility functions u;(x) = —(x — 
z;)*.These functions are plotted in Figure 2.2. 


Insert Figure 2.2 Here 


In higher dimensional applications X C R¢ distances are measured 
Euclidean norm defined as 


Thus, symmetric, single-peaked preferences take the form of 
u;(x) = h(— ||z — all) 


where again, z; € R@ is the ideal point of agent i, h : R' — R? is an 
increasing function. 
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It is difficult to visualize utility functions over multidimensional 
spaces. However, for 2 dimensions graphical analysis is simplified by 
the fact that each agent’s preferred sets i.e. P(y) = {x € X|xRy} form 
circular regions centered on the agent’s ideal point. Similarly, the set 
of points for which the agent is indifferent to y is a circle containing y 
centered on the ideal point. These sets are illustrated in Figure 2.3. 
For any indifference curve, an agent prefers an outcome inside the circle 
to any that lies outside it. 


Insert Figure 2.3 Here 


One of the reason that single-peaked, symmetric are so popular in 
applied political game theoretic models is the ease at which the pre- 
dicted choices of agents can be characterized. As long as one is willing 
to make the appropriate assumptions, choice over a pair of outcomes 
can be characterized by an agent’s ideal point and a “cutpoint” in R?! 
or a “cutting plane” in R?. 

To see this, consider an agent with symmetric single peaked prefer- 
ences over R'. Thus, agent 7 prefers x to y if and only if h(— |x — z;|) > 
h(—|y — z;|). Assuming that x > y, this condition becomes 


u+y 


Br ec= 


Conversely, yPx if and only if z; <c. Thus, given a set of agents 
and outcomes x > y, the model predicts that all agents with ideal 
points greater than the midpoint of x and y prefer y and those with 
ideal points lower than the midpoint prefer y. Note that this prediction 
is completely independent of h. 

This logic extends to R? as well. Now agent i prefers x to y 
if and only if h(-||z — z,||) > h(-|ly — zil]). Now we can define a 
separating hyperplane as follows. Let C = {ce |||a—cl] = |ly— cll}. 
This hyperplane is equivalent to the cutpoint in R'. It divides the 
ideal points into those who prefer x to y and those who prefer y to 
x. Again armed only with knowledge of ideal points and C, we can 
confidently characterize the choices of the agents. 


6. Exercises 


EXERCISE 2.1. Prove the following: Assume X is finite and R is 
a complete and reflexive binary relation on X. Then M(R,S) #0 on 
any SC X (except S =) iff R is acyclic. 


EXERCISE 2.2 (*). Prove that R is strict convex if and only if its 
lower contour sets P~!(x) are convex. 
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EXERCISE 2.3 (*). Show that if X C R" and the continuous utility 
function u: X — R! represents the binary ordering R on X then R is 
complete, reflexive, transitive, and continuous. 


EXERCISE 2.4 (*). Use theorem 3 to prove the Weierstrass theorem. 


EXERCISE 2.5 (*). Use Definition 2.12 to show that [0,1] is com- 
pact. 


EXERCISE 2.6. For the following utility functions, describe the pre- 
ferred set. You may do this either graphically or by formally charac- 
terizing P(x) = {y: yRa} for alla € X. Plot the utility curve if 
possible. 


(1) u(#) = —|1 —2| for x € [0,1] 
(2) u(x) = —2? for x € [0,1] 
(3) u(x) = Jz for x € [0,1] 


(4) u(x) = —axr? — (1 — a)x3 for x € R? 


EXERCISE 2.7. Let x = (2%1,%2) and y = (y1, y2) be two outcomes 
from R?. 


(1) Assuming that all agents have single-peaked and symmetric 
preferences, compute the separating hyperplane H as a func- 
tion of %1,%2, 41, and yo. Verify that it is a straight line. 

(2) Assume that each agent has non-symmetric preferences given 
by —a (a! — z})* — (1 — a) (2? — 22)? for « € R?.. What does 
C = {ec la(x!—c!)? + (1-a) (2? — 2) =a(y!—c!)? + (1 - 
a) (y? — c?)?} look like now? Does it divide the ideal points 
of agents who prefer x to y from those who prefer y to x? 


EXERCISE 2.8. Assume that an agent has spatial preferences R over 
R¢ represented by the utility function u;(x) = h(—||x — z;||). Prove 
that for any X C R4, M(R, X) is non-empty if z; is finite. Show that 
as long as X is conver, M(R,X) has a unique element. 


CHAPTER 3 


Choice Under Uncertainty 


In this chapter we drop the assumption that individuals can per- 
fectly predict the consequences of their actions. Instead we assume 
that outcomes arise probabilistically from the choice of actions i.e. that 
certain actions increase or decrease the likelihood of particular out- 
comes. Further, individuals are assumed to know which actions are 
most likely to produce which sorts of outcomes. Recall the example 
from the last chapter where A = {send in the troops, try negotiating, 
do nothing} and X = {win major concessions, win minor concessions, 
status quo}. The agent may believe that major concessions are more 
likely when the troops are deployed than when negotiation is initiated. 
Thus, she would have to trade off this likelihood of generating a better 
outcome against her costs of taking each action. Deploying the troops 
would be rational if it is much more likely to lead to major concessions, 
the additional concessions are valuable to the agent, or if the costs of 
deployment are low. These are the basic trade-offs underlying the 
classical theory of choice under uncertainty. 

There are two key elements of this approach. The first is the 
concept of beliefs which are modeled as probability distributions or 
“lotteries” over the outcomes associated with each action. The second 
is the specification of payoffs associated with each outcome. These 
payoffs are known as von Neumann-Morgenstern utility functions in 
honor of two of the pioneers of classical decision theory. As we shall 
see, the von Neumann-Morgenstern functions rely on a much stronger 
concept of utility than the ordinal functions discussed in chapter 2. 


1. The Finite Case 


Our presentation begins with the case of finite numbers of actions 
and outcomes. As in the previous chapter, we denote the feasible 
actions and outcomes as sets A = {aj,...,ar} and X = {a1,..., vy}. 
However, we now assume that actions and outcomes are linked proba- 
bilistically. To formalize this assumption, we assume that the outcome 
depends both on the action taken and the “state of the world”, s. From 
the point of view of the agent, s is a random variable like rainfall on 


23 
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election day or missile precision in a war. In decision-theoretic models 
(or in game theoretic models in which agents choose actions randomly), 
it can also represent the actions of other agents. We denote the set of 
states as S = {s),...,sK}. We assume that agents have beliefs about 
the likelihood of each of state represented by the probability function 
1 (S,) = Tz. These probabilities have to satisfy the basic axioms of 
probability theory — they have to be between zero and one, inclusive, 
and they must sum to one. Formally, we require 


0 < Tl ke < 1 
K 
Ty +Ta+... $e = ur: =] 

k=1 
Given this setup, we can formalize the linkage between actions, states, 
and outcomes with an outcome function defined as x (a,s): Ax S — 
X.‘ As an example, consider Table 1 which specifies an outcome for 
each combination of states and actions: 


Table 3.1 | 
A\S S1 | S2 | 83 
ay 
a2 


In this example, outcome x; occurs in state s; regardless of action. 
In states sg and s3, the outcome depends on the action chosen by 
the agent. From the agent’s perspective, however, it is not the state 
that matters so much as the likelihood of getting particular outcomes 
following each action. Since the agent does not know the state when 
she chooses a;, the probability of receiving outcome x is the probability 
that the state takes on a value s such that y (a;,s) =a. We let p;; be 
the probability that outcome x; occurs following action a;. 

Note that we can easily compute these probabilities from Table 1: 
Pu = 71+Te2, Piz = 73, Piz = O, Por = 71, P22 = Te, and p23 = 73. 
Thus, the general formula for these probabilities is 


Pi = S- Tk: 


{k:x(ai,8~)=a;} 


‘Because we only focus on outcomes that can occur for some combination of 
a and s, we require that the total number of action-state combinations be no less 
than the number of outcomes or I. K > J. 
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The probabilities p;; inherit the following properties from the 7: 


Os ppd 
bi 
Di t+ pet... + pi = So pis =]. foreach +. 
j=l 


In the remainder of this chapter, we simplify the notation by suppress- 
ing the dependence of the outcome probabilities on s to focus solely on 
pi;- However, in later chapters, we will use the action/state represen- 
tation of the agent’s problem more explicitly. 

Also to keep notational clutter to a minimum, we define the vec- 
tor pi = (pi,---,Piz) and refer it as the lottery over the outcomes 
associated with action a;. Because of the correspondence between the 
action and the lottery it generates, we will refer interchangeably to an 
agent choosing an action a; or simply choosing the lottery p;. Finally, 
let P be the set of all lotteries. Given that there are J possible out- 
comes, the set P consists of the set of vectors of length J that satisfy 
the above conditions (each element is between 0 and 1, and all coordi- 
nates sum to 1). This set is sometimes denoted A’ and termed the J 
dimensional simplex.” For two dimensions, the simplex is simply the 
straight line from coordinate (0,1) to (1,0) as in Figure 3.1. For three 
dimensions, it is the triangular segment of the plane through (1, 0,0), 
(0, 1,0), and (0,0, 1) as in the lower panel of Figure 3.11. 


Insert Figure 3.1 Here 


Another easy way to visualize lotteries is to model them as trees 
as in Figure 3.2. Beginning from the initial node, each branch cor- 
responds to a particular outcome and is labeled with the probability 
of that outcome. Thus, we can see that lottery p generates a larger 
probability of x, and a lower probability of x3 than the lottery q, while 
both lotteries generate the same probability of x2. To build some intu- 
ition for what follows, consider how an agent might choose between an 
action that generated p and one that generated q. First, it would seem 
unreasonable for the agent to base her decision on her preferences for 
X2 since both lotteries generate this outcome with identical probabili- 
ties. Since the difference between the lotteries is the relative likelihood 
of x, and x3, it would also seem that a rational agent would choose p 
only if 7,Rx3. Thus, using these two intuitive arguments (which we 
will formalize shortly), we can speculate that the agent would choose 
p if x, P2x3, q if 73Px,, and either lottery if x, / 73. 


We refer those readers who are unfamiliar with vectors and coordinate systems 
to the mathematical appendix. 
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Insert Figure 3.2 Here 


One feature of the preceding example that facilitated our intuitive 
prediction is that the lotteries involved are simple in the sense that each 
outcome is associated with a single probability number. However, it is 
often necessary to consider more complicated situations where agents 
must choose between lotteries over lotteries over lotteries.... Such situ- 
ations are known as compound lotteries. We can formalize this notion 
by defining a compound lottery over P by {a,...,a7} where a; rep- 
resents the probability of playing lottery p;. As an example, consider 
how an agent might evaluate a lottery in which the agent gets to play 
p with probability + and q with probability 3. We'll call this lottery 
r =;p+q. Its tree representation is given in Figure 3.3. How should 
an agent choose between p,q, andr? First, note that the availabil- 
ity of r should not change the preference ranking between p and q 
so we need only consider comparisons of r versus p and r versus q. 
However, these comparisons would seem to be difficult because of r’s 
compound structure. Fortunately, preferences over r are easy to an- 
alyze. We assume that our agents only care about the probabilities 
associated with each outcome, not the paths travelled to reach those 
outcomes. Thus, the agent can compute that the probability of re- 
ceiving outcome x, is the probability of receiving lottery p (+) times 
+ plus the probability of receiving q (#) times +. The probabilities of 
2 and x3 can be computed similarly. Since the agent can compute 
a single probability number for each outcome, r can be represented as 
a simple lottery as in the second panel of Figure 3.3. Formally, any 


compound lottery {a4,...,a@ 7} over P can be represented as a simple 
I 
lottery with the probability of x; given by > ek ORD 


Insert Figure 3.3 Here 


Given the reduction of r to a simple lottery, can we know say which 
lottery the agent will prefer? Note that in the reduction of r, the 
probability of x2 is again $ as it is in p and q. So preferences over 
Z are again irrelevant and only the comparison of 7; to x3 matters. 
Since under r the outcome x; is more likely than x73, any agent for 
whom 2x; P23 will prefer r over q. However, it also seems intuitive that 
such an agent would also prefer p to r since 7; is somewhat more likely 
under p. 

This discussion of how an agent with preferences over the outcomes 
X might evaluate different lotteries demonstrates the key features of 
the theory of choice under uncertainty is standard in classical decision- 


theory and non-cooperative game theory. Our goal is to formalize 
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a notion of weak preferences on P that allows us to deal with more 
complicated problems involving choice under uncertainty. Just as util- 
ity functions greatly simply the analysis of choice under certainty, the 
concept of expected utility will simplify the analysis of choice under 
uncertainty. Now we are in a position to formalize some of the intu- 
ition in the previous example. All of the intuition can be succinctly 
summarized into four axioms about weak preferences R on P. 


AXIOM 3.1. Completeness and Transitivity: The weak prefer- 
ence relation R over P is complete and transitive. 


AXIOM 3.2. Reduction of Compound Lotteries: For any a € 
[0,1] and p € P, pI [ap+ (1-a)p]. 

AXIOM 3.3. Continuity: Let p, q andr be three lotteries in P. 
The set of scalers a € [0,1] such that [ap + (1 —a)r] Rq is a closed 


interval and the set of scalers 3 € [0,1] such that qR [Sp + (1— 8)r] 
is also a closed interval.? 


AXIOM 3.4. Independence: Let p, q andr be three lotteries in P. 
For any scalar a € (0,1), pRq if and only if |ap + (1—a)r] R[aq+ (1 


The substantive meaning of each of the axioms is pretty straightfor- 
ward. First, we have to assume, just as in the case of outcomes, that 
any two lotteries can be compared and that preferences over lotteries 
do not cycle. This axiom is critical in our example as the predictions 
assume that the agent has well-behaved preferences over 71, x2, and 
x3. As in the last chapter, transitivity can be extended to indifference 
and strict preference. The second axiom simply formalizes Figure 3.3 
and guarantees that agents care only about the probabilities of the 
outcomes, and not the particular manner in which the probabilistic 
process of reaching those outcomes is represented.’ 

The 3.3 axiom is somewhat abstract but implies that small changes 
in the probabilities of outcomes should not lead to large changes in the 
preferences over lotteries. It requires that if pPq then all lotteries 
sufficiently close to p should also be preferred to q. If pPq, a modifi- 
cation of p that adds a very small probability of a really bad outcome 
will not reverse the preference ordering. The continuity axiom has the 
following straightforward useful implication. 


3The mathematical appendix has a detailed discussion of closed sets. For the 
present purposes, however, it is sufficient to know that a closed interval [a,b] is one 
that includes points a and 6 and all points in between. 

4In some texts this assumption is implicit when authors define prefrences over 
lotteries. We choose to make the assumption explicit to highlight that this theory 
ignores details regarding the representation of lotteries. 
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LEMMA 3.1. If pRqRr then there exists some X € [0,1] such that 
[Ap + (1 — A) x] Zq. 


Proor. Assume that pRqfr and that for any \ € [0,1], we have 
either [Ap + (1 — A) r] Pq or qgP [Ap+ (1—A)r]. If plq or rlq, we 
obtain a contradiction at a = 1 or a = 0 so we must have pPqPr. This 
implies that the sets {a : [ap + (1—a)r] Rq} and {G: qR[Sp+ (1 — 8)r] } 
are non-empty. By the continuity axiom, these sets are closed. This 
means that the first set contains a smallest element a and the second 
set contains a largest element, 8. Since the strict preference is not 
reflexive, we cannot have [Ap + (1 — A) r] Pq and qP [Ap + (1 — A)r] 
for any particular value of \. Thus we must have 8 < a. But then at 
» € (8, a) we have neither [Ap + (1 — \) r] Pq nor qP [Ap + (1 —A)r] 
contradicting the original hypothesis. LC] 


Finally, consider the independence axiom which is perhaps the most 
controversial.’ Suppose we have a preference ranking between two 
lotteries. If we mix each of those lotteries with a third (using the 
same probabilities), the independence axiom holds that the preference 
ordering will be the same as those over the original lotteries. As an 
example, consider two lotteries. The first pays $100 with probability 
.5 and $0 otherwise. The second pay $75 for sure. If we compound 
each of these lotteries with a .5 chance of $1,000,000 for sure and .5 
chance of playing the original lottery, the independence axiom says 
that the preferences over the compound lotteries should correspond to 
the original lotteries. Using the “tree” metaphor for lotteries, the 
independence axiom says that the comparison of two lotteries is based 
only on the comparison of the outcome branches that are distinct across 
lotteries. Thus, this axiom formalizes the intuition behind ignoring x2 
in our first example. The independence axiom is easy to extend to the 
case of indifference and strict preference. 


LEMMA 3.2. For any scalar a € (0,1), plq if and only if 
[ap + (1—a)r] I[aq+ (1—a)r]. 


PROOF. To show sufficiency, suppose that p/q. Then the inde- 
pendence axiom requires both [ap + (1 — a)r] R[aq+ (1—a)r] and 
laq+ (1-—a)r] R[ap+ (1 -—a)r]. Thus, [ap + (1 —a)r] J[aq+ (1 —a)r] 
is the only possibility. The proof of necessity is very similar. O 


LEMMA 3.3. For any scalar a € (0,1), pPq if and only if 


"In some texts and articles the independence axiom is called the substitution 
axiom. 
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[ap + (1-—a)r] Plaq+ (1 -a)r}. 


PROOF. To show sufficiency, suppose that pPq. Then the indepen- 
dence axiom requires that [ap + (1 — a)r] Rjaq+(1—a)r]. To show 
indifference is inconsistent, assume that [aq + (1 — a)r] J [ap + (1 — a) 
By the previous lemma this implies that q/p, contradicting the assump- 
tion that pPq. The proof of necessity is very similar. O 


An equally important but less direct implication of the indepen- 
dence axiom is the following lemma. 


LEMMA 3.4. If pRq anda € (0,1), then pR [ap+ (1 — a) q] Rq. 


PROOF. Since pPq, we can use the reduction of compound lotter- 
ies, lemma 3, and transitivity to show that 
pl |ap+ (1 — a) p] R |ap+ (1 — a) q] R[agt (1 — a) q] Iq. O 


This lemma just establishes that if we take a weighted average of 
two lotteries, the resulting lottery will have an intermediate preference 
ranking. Finally, the independence and continuity axioms have the fol- 
lowing implication which is crucial for the existence of expected utility 
functions. 


LEMMA 3.5. Suppose the alternatives are indexed so that x, Rx, for 
all j andx;Rx,z for allj. Then for all a, 8 € [0,1], 
lar, + (1—a) aj) R[B21 + (1 — 8) x5] of and only ifa> Bp. 


PROOF. Suppose a > 3. We can then write ax; + (1— a) ax, as 
yt, + (1 — y) [621 + (1 — 8) zs] where 7 = +8 € (0,1). From lemma 
4, 2,R[6x2,+(1-—)2,;]. Applying lemma 4 again, 

[yeti + (1 — y) [8e1 + 1 — 8) xy]] R[Bx. + (1 — 8) a]. 

The proof of necessity is identical to above where the roles of a and 

6 are reversed. LC] 


Sensibly when we compare lotteries over the best and worst out- 
come, we prefer the one with the greatest likelihood of producing the 
best outcome. With these axioms and lemma, we can now prove that 
preferences over lotteries can be represented by expected utility func- 
tions. We state the theorem in terms of preferences over lotteries. 
Recall that actions induce lotteries over outcomes and so an analogous 
statement can be made about preferences over actions. 


THEOREM 3.1. (von Neumann-Morgenstern) If axioms 3.1-3.4 hold, 
then there exists a function u(x;) (which assigns a number u,; for each 
outcome) such that 


i] 
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i) the expected utility of lottery p; (with is induced by action i ) is 
given by 
‘i 


EU(p;) = py + Prog +... + DUT = So piztt; 
j=l 


it) p;Rp,; (i.e. a;Ra;) if and only if EU(p;) > EU(p;). 


The function u(x;) is sometimes called a Bernoulli utility function, 
to distinguish it from the expected utility function EU(p). It is im- 
portant to note that we are talking about two different types of utility 
functions. The Bernoulli, (or lower cased) functions are defined over 
outcomes. The expected utility (capitalized) functions are defined over 
lotteries. A subtle point, is that Theorem 1 starts with preferences 
over lotteries that satisfy the four axioms and states that we can con- 
struct an expected utility function over lotteries, that has a particular 
form —the expected utility of a lottery is nothing more that the ex- 
pected value of lottery given the values of the outcomes specified by 
the Bernoulli utility functions. 

The expected utility of a lottery is simply the average of the util- 
ities over outcomes weighted by the probabilities of each outcome. 
For example, if we assigned utilities to outcomes 71, %2, and x3 of 
u(x1), u(#2), and u(x3), then the expected utility of lottery p would 
be EU (p) = 3u (x1) + 3u(x2) + Zu(x3) while that of q would be 
EU (q) = qu(x1) + $u(x2) + zu(x3). One of the most attractive 
properties of expected utility functions is that they are linear in the 
outcome utilities. Among other things this implies that EU (r) = 
EU (4p+iq) = {BU (p) + {EU (q) = eu (a1) + 3u (x2) + Bu (zs) 
which is exactly what one gets from computing the expected utility of 
the reduced lottery. 

We do not prove the von Neumann-Morgenstern theorem formally 
in this section, but we can sketch it in the case of three outcomes. 
The proof of the general result is very similar but uses mathemati- 
cal induction to extend to an arbitrary number of alternatives. Let 
X = {%1,%2,73} where x,Rx2Rxr3 and assume that at least one of 
the preferences in strict — 7,/x2Ix3 is a trivial case. We will rep- 
resent a lottery over X as a vector (pi,p2,p3). From Lemma 1, we 
know that there exist @ such that r2I(a,0,1— a). Similarly, we know 
that 7,J(1,0,0) and x3/(0,0,1) Therefore, let wu; = 1, ue = a, and 
ug = 0. Now consider any lottery p = (pi, p2,p3). From this lottery, 
we can form the compound lottery (p; + w2p2,0,p3 + p2(1 — u2)) by 
substituting the lottery (uw2,0,1— uz) for the degenerate lottery that 
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reaches x2. Using Lemma 2, we know that the agent must be indif- 
ferent between this compound lottery and p. Since we can make sim- 
ilar substitutions for 7; and x2, the agent is indifferent between p and 
{piuy + pot, + p3u3,0, pi (1 — ui) + pe (1 — ue) + ps3 (1 — ug) }. Now con- 
sider an alternative lottery q. By replicating the above arguments, 
we know that the agent is indifferent between q and (qu, + qot2 + 
qsuz, 0, qi (1 — u1) + G2 (1 — uz) + 3 (1 — us). 

For the grand finale, the application of Lemma 5 says that pRq if 
and only if pyuy + poug + p3u3 > git + gote + g3u3. Thus, we can 
represent preferences over lottery p by the scalar pyu1 + pote + p3us. 
Note that the theorem does not say that any a € [0,1] will work. 
Rather the theorem ensures that there exists at least one such @ so 
that the outcome utilities, uj; = 1, ug = a@ and uz = 0 will work. 


1.1. Cardinal Utility. In the previous chapter, we assuaged utility- 
skeptics with the argument “relax utility functions do nothing more 
than represent preference orderings.” Once we move into the world of 
expected utilities, however, such a defense is no longer tenable. The 
utility functions over outcomes u(x;) are no longer simply ordinal, but 
cardinal in that they contain information about relative preferences 
over outcomes. Just as the Fahrenheit temperature scales ability to 
say that the difference between 212° and 32° is twice the difference 
between 122° and 32°, cardinal utility functions allow us to say that 
“my preference for steak over chicken is 3.8 times my preference for 
chicken over fish.” Thus, unlike the case of ordinal utilities, the value 
u(x;) — u(x,) has a meaningful interpretation. 

It is easy to see why expected utility theory depends on cardinal 
utility functions. Suppose that an agent were choosing between two 
lotteries over the three outcomes 71, %2,7%3 with 7;Px2Px3. Lottery 1 
provides a .5 shot at x; and a .5 shot at x3 while lottery 2 gives x2 with 
certainty. Suppose we had an expected utility representation u(x) = 
1, u(r) = a € (0,1), and u(x3) = 0 which predicted that the agent 
would choose lottery 1. If this representation were simply ordinal, we 
could apply any order preserving transformation to the utility function, 
and the resulting function would represent the exact same preferences. 
But consider the following transformation of the utilities: v(a) = 1, 
vu (a2) = 1— a, and v(x3) = 0. This transformation preserves the 
preference ordering, but now the agent would prefer lottery 2. 

However, just as Fahrenheit is not the only temperature scale which 
produces identical relative information about heat, expected utility rep- 
resentations are not unique. To see this, consider two cardinal utility 
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functions u(x;) and v(z;) = a+ bu(x;). Given a lottery p, these pro- 
duce expected utility functions of is pj;u(x;) anda+b yo pju(a;) 
respectively. Since the ordering of ey p;u(a;) and Sean qju (x;) will 
be the same as the ordering of a+b ae p;u(a;) and a+b Dae qju (x;) 
as long as b > 0, each cardinal utility function produces exactly the 
same behavior. This also implies that u(a;) — u(z;,) is unique up to 
a scale factor b while relative differences eg are uniquely deter- 
mined. 


2. Risk Preferences 


One aspect of choice under uncertainty that is not pinned down by 
the axioms of the previous section is the set of risks that a rational 
agent is willing to tolerate. Some agents may be willing to accept a 
substantial probability of a bad outcome in exchange for moderately 
higher probabilities of good outcomes, while others will seek to min- 
imize the probability of bad outcomes by forgoing opportunities for 
high payoffs. Recall, that the continuity axiom says that given three 
outcomes «PyPz, an agent will prefer a lottery between x and z toa 
certainty of y if the probability of z is sufficiently small. However, the 
axiom is silent about how small this risk needs to be. 

The way we characterize an agents’s preference or toleration of risk 
is whether the agent is willing to accept a fair bet. A fair bet is one 
that pays its stake (or price) in expectation. We will suppose for now 
that the risky stakes and rewards (i.e. outcomes) are denominated in 
money or some other commodity which agents tend to prefer more of. 
In latter sections, we will extend these definitions to outcome spaces 
where agent’s have satiable preferences. Let w be the stake and 7; > x 
be distinct monetary outcomes and p be the probability of 7; and 1—p 
be the probability of 72. Then we have the following definitions: 


DEFINITION 3.1. A bet is fair if w = px; + (1 —p)ao. A bet is 
favorable if w < pry +(1—p)a2. A bet is unfair if w > px, +(1—p)xre 


For example, a fair bet would be buying a $1 lottery ticket that 
pays $100 with probability wi and nothing with probability im The 
bet would be favorable if the ticket cost less than a dollar and unfair 
if it cost more. Needless to say, all lotteries that are called “lotteries” 
in the real world (especially those run by state governments) are of 
the unfair variety. Using the notion of fair bets, we can characterize 


preferences for risk. 
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DEFINITION 3.2. An agent is risk adverse if she will not accept any 
unfair bets, or u(px1 + (1 — p)x2) > pu(ai) + (1 — p)u(x) 


DEFINITION 3.3. An agent is risk acceptant if she will accept some 
unfair bet or u(px1 + (1 — p)x2) < pu(ai) + (1 — p)u(a) 


DEFINITION 3.4. An agent is risk neutral if she is indifferent be- 
tween any fair bet and its stake or u(px; + (1 — p)x2) = pu(x1) + (1 - 
p)u(x2) 


It turns out that an agent’s preference for risk is closely related to 
the shape of her utility function for money. Consider Figure 3.4 which 
demonstrates the utility comparison for the fair bet w = px; + (1 — 
p)x2. Note that the line connecting the coordinates (u(x1),21) and 
(u(x2),22) must travel through the point 


(pu (x1) + (1—p)u (a2), par + (1—p) a2). 
Thus, we know that the value of pu(x,) + (1 — p)u(z2) lies at the 
intersection of the line between u(x,) and u(a2) and the vertical line 
beginning at w. Thus, we can see that u(w) > pu(z1)+(1—p)u(x2) so 
that the agent is risk adverse and rejects the fair bet. Obviously, the 
property of the utility function responsible is the fact that the utility 
function always lies above any line connecting two utilities. This is 
the property of concavity. 
Insert Figure 3.4 Here 

In Figure 3.5, we can see that the utility function always lies below 
lines connecting two utility values. For a convex utility function such 
as this one, the agent will always accept the fair bet as pu(x1) + (1 — 
p)u(a2) > u(pri + (1 — p)a). Figure 3.6 illustrates that linear utility 
functions produce risk-neutral behavior as they imply that the expected 
utility of a gamble is identical to the utility of the expected outcome. 

Insert Figures 3.5 and 3.6 Here 
2.1. Risk Preferences and Stochastic Dominance™*. It is not 


conceptually, difficult to extend these ideas to lotteries that assign pos- 
itive probability to an arbitrary (finite) number of possible outcomes. 


DEFINITION 3.5. An agent (or its preference relation) exhibits risk 
aversion if for any non deterministic lottery p, u()_, Pjxj) > D1, pju(z)). 


DEFINITION 3.6. An agent (or its preference relation) exhibits risk 
acceptance if for any non deterministic lottery p, u(d), Pjtj) < 2, pju(zj) 


To relate the notion of risk aversion to potential behavior, we need 
to define a few additional concepts. Given a lottery p the expected 
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value is E(p) = )/,p;x; and the variance of the lottery is V (p) = 
>=, pi(E (p) — xj) For a given lottery, p and expected utility represen- 
tation u = (w,....,uz), the certainty equivalent, C (p), is the amount 
of money that the agent values as much as the lottery. That is we 
have, 
u(C (p)) = EU(p). 

The behavior of risk adverse individuals is somewhat predictable. They 
are willing to sacrifice some expected value for reductions in variance so 
that certainty equivalent is less than the expected value. In addition 
if we consider two lotteries p and a lottery q with the same expected 
value as p but a greater variance, then the agent will prefer p. 


DEFINITION 3.7. We say that q is a mean preserving spread of p 
if q is a compound lottery that first takes the realization of p and then 
adds to it a random term € with distribution z having E (z) = 0 and 
V (2) 0. 


THEOREM 3.2. Given a preference relation R on lotteries and the 
Bernoulli utility function u(x) that is used to represent R, the following 
statements are equivalent 

1. The preference relation R exhibits risk aversion 

2. The utility function u(x) is strictly concave 

3. For any lottery p, C(p) < E(p) (and if V (p) > 0 the inequality 
is strict). 

4. For any two lotteries q and p in which q is a mean preserving 
spread of p we have EU(q) < EU(p) 


PRooF. The equivalence between (1) and (2) is immediate. To 
show that (2) implies (3) assume that R exhibits risk aversion. This 
implies that for any non-deterministic lottery p (so that V (p) > 0), we 
have u(E (p)) > >¢, pju(x;). But since u(C (p)) = EU((p)) it must be 
the case that u(E (p)) > u(C (p)). Since u(x) is in increasing function 
this implies that E (p) > C (p). 

To see that (3) implies (2), assume that (3) is true and consider 
any two outcomes x and x2 with p; € (0,1). In this case (3) and the 
fact that u(x) is increasing implies that u(px, + (1 — p)x2) > pu(x,) + 
(1 — p)u(x2) and thus the function u(x) is concave. Since V (p) = 0 
and p deterministic result in equality this case is trivial. 

To see that (2) implies (4) consider p and a mean preserving spread 
q that results in « + € with x having the distribution p and « € 
{€1,...,er} having the distribution z. We have 


EU(q) = S- x zpjulz; + €2). 
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By (2), for each value of x; we have 
a zpjw(aj + et) < u(d— xD; (3 + Et). 
t t 
Rearranging the right hand side yields 
Yo zepju(ay + €) < ulpjay +) &) = ulpja). 
t t 
Summing over 7 yields 


EU(q) = Sy Do 2epjula; +&) < > u(r ;3) = EU(p). 


To see that (4) implies (3) consider a lottery, q. Note that the lottery q, 
is a mean preserving spread of the lottery p which assigns probability 
1 to Eq, (4) implies that EU(p) > EU(q). Since p is deterministic, 
u(Ep) = EU(p) so we have u(Ep) > EU(q). Since u(C(q)) = EU(q) 
monotonicity of u(x) implies that EF (p) = E(q) > C(q). O 


If lottery q is a mean preserving spread of p, lottery q is said to be 
second order stochastically dominated by p. One result which can be 
proven by reapplying the logic of the last proof is gives a convenient 
characterization of second order stochastic dominance. 


THEOREM 3.3. Lottery q is second order stochastically dominated 
by p if and only if for any concave increasing function u(x), 


> Piu(e;) 2 De qju(xj). 


Thus, risk averse individuals have preferences that conform with 
second order stochastic dominance —if p second order stochastically 
dominate q, a risk averse agent will prefer p to q. Now in some cases 
choice over lotteries is trivial. One common notion of “no brainer” 
decision problems involves choice between a lottery and another that 
is said to first-order stochastically dominate it. 


DEFINITION 3.8. Lottery q is first order stochastically dominated 
by p if for any non decreasing function u(x) 


> Pju(e;) z = qju(x;). 


So in choosing between lotteries which are ordered by first order 
stochastic dominance, risk attitudes are irrelevant. 
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2.2. Risk Preferences with Satiable Preferences. As we dis- 
cussed in Chapter 2, many of the utility functions used in political 
science are satiable in that agents have most preferred outcomes. As- 
suming such preferences, however, entails implicit assumptions about 
risk. Consider the function in Figure 3.7 and a lottery over some x; 
less that the agent’s ideal point and x2 greater than the ideal point. 
As above, the expected utility of such a lottery is the intersection of 
the line between u(2 ) and u(x) and the vertical line beginning at w. 
However, note that since the ideal point lies between x; and 22, there 
must be at least one outcome wy, in this interval such that u(w1) > 
pu(a1) + (1 — p)u(x2). Thus, satiable preferences must produce risk 
adverse behavior at least in regions near the ideal point. Satiable pref- 
erences need not produce global risk aversion, however. Gambles over 
a set of outcomes bounded away from the ideal point for which the 
agent’s utility function is convex will produce risk acceptant behavior. 
The next section elaborates these points in more detail. 


Figure 3.7 


2.3. Risk and Higher Dimension Euclidean preferences. In 
this subsection we consider choice over lotteries on R”. In the case 
of preferences that are Euclidean, Bendor and Meirowitz (2004) have 
shown that we can extend the notions of risk aversion from the case of 
strictly increasing preferences. Recall, that preferences over x € R” 
are Euclidean if they are representable by a utility function of the form 


u(a) = A(— |e — #*I)) 


where x* is a point in R” and h : R! — R' is a strictly increasing 
function. If the function h is strictly concave then it is not difficult 
to see that the utility function u(x) is itself strictly concave. In this 
case the Bernoulli utility function will represent risk averse preferences. 
However, even if the function h is not concave, the preferences exhibit 
a form of risk aversion. 'To extend the concept of a mean-preserving 
spread to R”, we simply apply Definition 3.7 with the relevant states 
in R”. 


THEOREM 3.4. If u(x) is Euclidean then for any two lotteries q 
and p on R” with expected value x* in which q is a mean preserving 
spread of p we have EU(q) < EU(p). 


PRooF. Assume q and p are lotteries on R” with expected value 
x* in which q is a mean preserving spread of p.Define d; = ||x; — x*|| 
to be the real variable that measures the distance between random 
variable x and the point x*. Now since preferences are Euclidean we 
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have EU(q) < EU(p) if and only if, 
do aGh(— [ej — 2" 1) < Sf yh lary — 2*|)) 
J j 


for some increasing function h(-). This condition is satisfied if and 


only if 
S-ajh(—d;) < S$ pjh(—d)). 
J ) 
Since q is a mean preserving spread of p, it must be the case that 
Dd Gd; > Dd Pidy. 
J J: 
But this means for any increasing function g(-) including h(-) we have 
> 49(-4;) < 5" p;9(—d)). 
J J: 
L] 


Informally, for lotteries that are centered at the agent’s ideal point 
second order stochastic dominance in outcomes corresponds to first 
order stochastic dominance in terms of disutility, and thus risk attitudes 
are not directly relevant to assessing how agents will compare lotteries. 
Alternatively put, all Euclidean agents are risk averse over lotteries 
that are centered at their ideal point. 


3. Learning 


Since agent’s beliefs about the probabilities of various outcomes 
is the key to decision-making under uncertainty, it is important to 
analyze how a rational agent should respond to new information about 
the likelihood of various outcomes. Once again it is prudent to begin 
with an example. Consider Figure 3.8 where an agent believes that 
the incumbent politician is “good” with probability 3 and “bad” with 
probability ;. Suppose however that she could incorporate information 
about the incumbent’s performance in office such as the inflation rate 
in the economy. How would this change her probability assessment of 
the incumbent’s quality? 

Insert Figure 3.8 Here 

First, let’s assume that the agent knows that good incumbents are 
more likely than bad incumbent to produce low inflation. In this ex- 
ample, we suppose that the agent knows that good incumbents produce 
low inflation with probability E and that bad incumbent produces low 
inflation with only a : probability. The first thing that our intuition 
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tells us is that when inflation is low the agent should increase her prob- 
ability assessment that the incumbent is good higher than her original 
belief of 3. Conversely, when inflation is high, the agent should lower 
the probability that the incumbent is good. Fortunately, we can take 
the analysis a step further and compute the exact probabilities that 
should be assigned a good incumbent after either realization of the 
inflation rate. 

First, consider the case where inflation is low. <A rational agent 
should know that the outcome is either the top node or the third node 
of the second panel of Figure 3.8. Further, she knows that and that 
there is a 3.2 = 5 of the top node and a rae = a probability of reaching 
the third. Therefore, after observing low inflation, it is 10 times as 
likely that the incumbent is good than that he is bad. Let p (I) be the 
probability of a good incumbent conditional on low inflation. Since 
probabilities must sum to one, p (1) + 0) = 1 so that p (1) = 72. We can 
use similar reasoning to show that p(h) = <. Note the confirmation 
of our intuition that a realization of a low inflation should raise the 
probability that the incumbent is good while high inflation lowers it. 

To generalize this example, we will be a need to be a bit more precise 
about the underlying probability theory. Let A and B represent two 
events (such as the terminal nodes in Figure 3.8). Suppose we know 
that event B has occurred and wish to compute the probability that 
event A occurs. This is known as the conditional probability of A 
given event B. We denote it as follows: 


Pr(A&B) 


Pr(A| B) = So 


assuming Pr(B) > 0. 
where Pr(A) is the probability of event A, Pr(B) is the probability 
of event B, and Pr(A&B) is the probability that both events occur 
(known as the joint probability). 

This formula which is often termed Bayes’ law is attained by divid- 
ing both sides by Pr(B), an operation which is permissible as long as 


this value is non-zero. As a special case independent events have the 
property that Pr(A & B) = Pr(A)Pr(B) so that 


Pr(A) Pr(B) 


Pr(A | B) = So 


= Pr(A) 

To see this rule in action, note that the probability of low inflation and 
a good incumbent (4) is the probability of low inflation conditional on 
a good incumbent (2) times the probability of a good incumbent (#) . 
Given these definitions, we can state the main result. 
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THEOREM 3.5. (Bayes’ Law) Let A, .... Ay be disjoint events 
(i.e. no two can occur simultaneously) such that )*> Pr(A,) = 1 and 
Pr(A,) > 0 for all n. Let B be some other event (which may occur 
concurrently with any Ay, ). 


Pr(B | An) Pr(An) 
Pee ee SecA easy 


Bayes’ Law gives us an easy to use formula to compute how ratio- 
nal agents should update their probability assessments following new 
information. Note that we can easily apply it to the voter’s problem 
from above. Let A; be the event that the incumbent is good and A 2 
be the event that she is bad. Since the incumbent cannot be both good 
and bad, these events satisfy the requirement of disjointedness. Event 
B is low inflation. For two events the formulas are: 

Pr(Ay | B) = Pr(B | Aj) Pr(A;) 
Pr(B | Aj) Pr(A;) + Pr(B | Ap) Pr(A2) 


and 


Pr(B | Ag) Pr(A2) 
B)= 
Pia |B) oem (Ay) PAs) + Pr(B | Aa) PHA) 
We can obtain all of the following probabilities from Figure 3.8: 


3 
Pr(A1) = Ke 
1 
Pr(A2) = 4 
2 
Pr(B | Ai) = 3 
1 
Pr(B | Ag) = ie 
Thus, we can plug these numbers into Bayes’ Law to get: 
2.3 10 
NO <Fashy hl 
and 
hod 
Pr(A, | B) = — 
rishy ol 
Voi la! 


While seemingly straightforward and logical, the application of Bayes’ 
Law is often criticized as a poor model of learning. Not only can it be 
computationally challenging and exceed the typical individual’s grasp 
of conditional probability, it can also produce counter-intuitive predic- 
tions. Consider the following scenario from the Let’s Make a Deal 
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game show hosted by Monte Hall. Monte offers contestants the choice 
of opening three doors. Behind one door is a luxury car while the 
other doors hide prizes of little pecuniary value (goats seem to have 
been a favorite). Once a door is selected but before it is opened, 
Monte opens one of the remaining two doors to reveal a goat. He 
then asks the contestant if he would like to switch his selection to the 
remaining closed door. Should the rational contestant switch? Most 
people would intuitively say there is nothing to gain from switching on 
the grounds that getting the car from a subsequent switch is just as 
likely as getting it on the original try. The probability of winning the 
car is ; either way. Indeed a number of mathematicians and statisti- 
cians took this position in response to the publication of this problem 
in a popular newspaper column. However, this logic is incompatible 
with Bayes’ law. 

To simplify, suppose the contestant chooses door 3. Since the doors 
are ex ante the same, the analysis of the other cases is identical. First, 
consider the probability of winning if the contestant does not switch 
to the remaining door. Obviously, this is the same as the original 
probability that there is a car behind door 3 or 5. However, now 
consider the probability of winning by switching. To formalize, let Aj, 
A», Ag correspond to the car being located behind doors 1, 2, and 3 
respectively. Let B,, Bz corresponds to the event that Monte opens 
door 1 or 2. Since we assume that Pr(A,) = Pr(Ag) = Pr(A3) = 3, we 
simply need to compute for all of the events Pr(B;|A;) . Since Monte 
will never expose a car, Pr(By|A1) = Pr(B2|A2) = 0. We also assume 
that in the event A3 Monte randomly selects which goat to expose. 
Therefore, Pr(B,|Az) = Pr(B2|Ai) = 1 and Pr(B,|A3) = Pr(Ba|As) = 
1 


a 
Suppose Monte opens door 2, then the probability that a switching 
contestant wins is equal to 


hey Pr( Bp | Az) Pr(A1) 
Pr( Bz | Ay) Pr(Ai) + Pr( Ba | Az) Pr(Ae) + Pr( Be | As) Pr(A3) 
1-5 2 
~T-240-242.2 3 


Similarly, if Monte opens door 1, the probability of winning is Pr(A2 | 
By) = 2. So a switching contestant wins with probability E whereas a 
sticking one only wins 5 of the time.° 


6The solution we present to this problem is somewhat convoluted in order to 
provide an additional demonstration of Bayes’ Rule. An easier proof is to note 
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So why does the intuition that switching doesn’t pay fail us so 
badly? The reason is that most people do not appreciate the impli- 
cation of the fact that Monte will never reveal a car. Thus, observing 
that he doesn’t open a particular door is information that a switcher 
can use in his decision that a stand-patter cannot. 

While the Monte Hall problem does expose a critical set of problems 
with Bayesian learning, such objections can be carried too far. Bayes’ 
Law does tell us correctly that switchers will win 2 of the time. Thus, 
a frequent viewer of the show can learn that one should switch even 
without ever doing a conditional probability calculation. So one can 
justify the use of Bayes’ rule by appealing to the notion that agents are 
acting as if they had performed the calculation even if they are simply 
following rules that they have learned from experience. 


A, Critiques of Expected Utility Theory 


While most of the models used in this book will rely heavily on 
expected utility theory, it is worth pointing out that there is a large 
and influential body of work critical of expected utility theory. How- 
ever, the application of these critical insights and alternative models to 
political game theory is still in its infancy.’ 


4.1. Risk, Uncertainty, and Subjective Probability. The econ- 
omist Frank Knight argued that expected utility theory is a model of 
risk rather than uncertainty. He defines uncertainty as the situation 
where individuals lack sufficient statistical information to form esti- 
mates of the probabilities of various outcomes. In other words, in a 
situation of uncertainty, individuals do not know the true set of lotteries 
P. However, the statistician Leonard Savage responded that expected 
utility theory could be resuscitated by assuming that individuals have 
subjective beliefs about P which can be used to formulate probability 
distributions over outcomes. 

However, Daniel Ellsberg formulated the following paradox which 
cast doubt over whether uncertainty was reducible to beliefs about 
beliefs. Suppose that there are two urns containing red and black 
balls. In urn 1, there are 100 red and black balls where the proportion 
of red balls is unknown. Urn 2, however, contains 50 red balls and 50 
black balls. 


that a switcher only loses if he picked the right door in the first place. Thus, a 
swicther loses $ of the time and wins 2. 
"“Behavioral” (as opposed to those based on expected utility theory) have 


become far more common in economics in recent years (see Camerer 200x). 
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Now suppose that subjects are given $100 for selecting a red ball. 
Most subjects choose to select from urn 2. But when offered $100 
for selecting a black ball, the modal choice is again urn II. However, 
choosing urn 2 for both gambles violates the axioms of expected utility 
theory. According to expected utility theory, choosing urn II in search 
of a red ball indicates a belief that urn I has fewer than 50 red balls 
while selecting urn 2 for a black ball suggests that the subject believes 
that urn 1 has fewer than 50 black balls. Obviously, these beliefs are 
inconsistent with the knowledge that urn 1 contains 100 balls. Select- 
ing urn 1 in both gambles similarly violates expected utility theory. 


4.2. The Allais Paradox. The predictions of expected utility 
theory have been tested in a number of experimental settings. These 
studies have provided robust evidence for a number of decision-making 
anomalies inconsistent with expected utility theory. One of the ear- 
liest and most studied anomalies was first uncovered by the French 
economist Maurice Allais. This anomaly is based on the finding that 
subjects often make choices inconsistent with the independence axiom. 

Initially, subjects are asked to choose between lotteries a and b 
where: 

Lottery a: .33 chance of $2500, .66 chance of $2400, and .01 chance 
of 0 

Lottery b: $2400 for sure 

When given these choices, subjects overwhelmingly choose lottery 
b. For example, Kahneman and Tversky (1979) find that 82% choose 
lottery b when given this hypothetical choice. 

Next the subjects are given the choice between lotteries c and d. 

Lottery c: .33 chance of $2500, .67 chance of 0 

Lottery d: .34 chance of $2400 and .66 chance of 0 

Experimental subjects generally choose c. Kahneman and Tversky 
find that 83% choose this lottery. However, it can easily be shown 
that choosing b in the first experiment and c in the second violates the 
independence axiom and therefore expected utility theory. First, note 
that the choices of b and c imply that 


u(2400) > .33u(2500) + .66u(2400) + .01u(0) 
and 
.33u(2500) + .67u(0) > .34u(2400) + .66u(0) 


Rearranging the top inequality, we get .34u(2400) > .33u(2500) + 
.Olu(0) for the first inequality and for the second we get .33u(2500) + 
.O1lu(0) > .34u(2400). Thus, we derive a contradiction. To see that 
the contradiction is attributable to a violation of the independence 
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axiom, note that lottery a can be written as the compound lottery 
.34(33/34, 0, 1/34) + .66(0, 1,0) over the outcomes (2500, 2400, 0) while 
b is .34(0, 1,0) + .66(0, 1,0). Thus, if aPb, then the independence ax- 
ioms holds that (33/34,0,1/34) P (0,1,0). But this in turn implies 
that .34(33/34, 0, 1/34) + .66(0,0,1) P .34(0, 1,0) + .66(0,0,1) which 
means that cPd. 


4.3. Prospect Theory. In their classic article, Kahneman and 
Tversky (1979) propose an alternative model of decision-making to ac- 
count for the Allais paradox and other experimental anomalies. Whereas 
many previous authors attributed the Allais paradox to a preference 
for certainty, Kahneman and Tversky note that the independence ax- 
iom is often violated when all of the lotteries are far from sure things. 
Consider the following pairs of lotteries. 

Lottery a: .45 chance of $6000, .55 chance of 0 

Lottery b: .90 chance of $3000, .10 chance of 0 

Lottery c: .001 chance of $6000, .999 chance of 0 

Lottery d: .002 chance of $3000 and .998 chance of 0 

They find that the modal choices were b over a and c over d, choice 
which violate the independence axiom. Since lotteries c and d have 
miniscule probabilities, subjects seem inclined to go for the one with 
the bigger prize. However, when both probabilities are reasonably 
high, subjects are still inclined to take the one that is relatively more 
certain. 

Kahneman and Tversky note, however, that this preference for cer- 
tainty does not hold when gambles are over losses rather than gains. 
Consider the following pairs of lotteries: 

Lottery a: .80 chance of —$4000, .20 chance of 0 

Lottery b: —$3000 for sure 

Lottery c: .20 chance of —$4000, .80 chance of 0 

Lottery d: .25 chance of —$3000,.75 chance of 0 

If the Allais paradox were simply due to preferences for certainty, 
the modal choices would again be for b and c. However, Kahneman 
and Tversky find that a and d are the modal choices. Their inter- 
pretation is that while individuals are risk adverse over gains, they are 
risk acceptant over losses. 

Finally, Kahneman and Tversky argue that the presentation of the 
lotteries can affect the choices that individuals make. Suppose that 
an individual has been given $1000 and then offered: 

Lottery a: .5 chance of an additional $1000 

Lottery b: $500 for sure 
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Next consider an individual who has been given $2000 and offered 
the choice of: 

Lottery c: .5 chance of losing $1000 

Lottery d: Lose $500 for sure 

Kahneman and Tversky find that b and c are the modal choices 
even though expected utility theory holds that a and c are identical 
lotteries as are b and d. 

To account for these anomalies, Kahneman and Tversky propose 
prospect theory as an alternative to expected utility theory. Accord- 
ing to their model, choice involves two distinct phases: editing and 
evaluation. In the editing phase, individuals “organize and reformu- 
late the options so as to simplify subsequent evaluation and choice.” 

4.3.1. The Editing Phase. Kahneman and Tversky identify six dis- 
tinct operations that occur during the editing phase. 


(1) Coding: Since Kahneman and Tversky argue that individuals 
evaluate gains and losses separately. Thus, the first stage 
of editing involves determining a reference point and coding 
outcomes as either gains or losses. 

(2) Combination: Individuals combine probabilities associated 
with identical outcomes. 

(3) Segregation: Individuals identify and segregate the riskless 
components of a choice. For example, a lottery than produces 
$200 with probability .7 and $100 with .3 is interpreted as a 
riskless $100 gain and a lottery over an addition $100. 

(4) Cancellation: When comparing two lotteries, individuals ig- 
nore the common elements of both lotteries. For example, the 
$2000 bonus in the last example “cancels out” and does not 
effect the choice between c and d. 

(5) Simplification: Individuals may simplify the tasks by rounding 
probabilities such as recoding .49 to even odds or by dropping 
extremely unlikely outcomes from consideration. 

(6) Detection of dominance: Individuals drop from consideration 
any lottery that is first-order stochastically dominated. 


4.3.2. The Evaluation Phase. Kahneman and Tversky’s model of 
evaluation is very similar in form to expected utility theory in that both 
models postulate that individuals evaluate gambles using a weighted 
average of the payoffs to the outcomes. However, in Kahneman and 
Tversky’s model the weights used are not the subjective probabilities 
of the outcomes but rather functions of the probabilities. They also 
argue, contra expected utility theory, that the outcome value functions 
should treat gains and losses asymmetrically. 
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Let x and y be two distinct monetary outcomes where p is the 
probability of x, q is the probability of y. With probability 1 — p— q, 
nothing happens or the payoff is 0. Kahneman and Tversky define 
prospects as strictly positive if x,y > 0 and p+ q = 1, strictly negative 
if x,y < 0 and p+q =1, and regular in all other cases. For a regular 
prospect, individuals are assumed to maximize 


V (x, p39.) = 7 (p) v(x) + 7(g)vly) 


where v(x) and u(y) are the values of each outcome and 7(p) and 7(q) 
are weights based on the outcome probabilities. They assume that 
v(0) = 0,7(0) = 0, and z(1) = 1. Note this function would be an 
expected utility function if v were a Bernoulli function and 7(p) = p 
for all p. 

For strictly positive or strictly negative prospects such as x > y > 0 
and « < y < 0 where p+ q = 1, individuals maximize 


V (2, py, 9 = vy) +7 (p) [v(z) — vy). 


This functional form captures the idea that individuals evaluate such 
lotteries as a risk-free component u(y) plus a risky component u(x) — 
u(y). 

A key assumption of Prospect Theory is that v(-) is asymmetry 
with respect to gains and losses. Kahneman and Tversky make three 
specific assumptions. 


(1) The value function is defined in terms of deviations from a 
reference point (no gains or losses). 

(2) The value function is concave for gains and convex for losses. 

(3) The value function is steeper for losses than for gains. 


Figure 3.9 illustrates a function satisfying these properties. 
Insert Figure 3.9 Here 


Additional, Kahneman and Tversky make several assumptions about 
the form of the decision weights 7 (p). 


(1) 7 is an ee function of p. 


(2) 7(0) = 

(3) m1) = 

(4) For a values of p, 7(p) > p. 

(5) For small values of p, 7 is subadditive i.e. a(rp) > ra(p) for 
Or 1, 

(6) For all p, 7 satisfies the property of subcertainty i.e. a(p) + 
m(1—p) <1. 


(7) For all 0 < p,q,r < 1, z is subproportional i.e. a e < ape). 
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The first three assumptions are straight-forward. The fourth is sim- 
ply the idea that individuals over-weight small probabilities. Subad- 
ditivity, which helps resolve the Allais paradox, implies in conjunction 
with Assumption 2 that p is convex for small values of p (see exercises 1 
and 2). Subcertainty also helps resolve the Allais paradox. Recall that 
the modal choices require that v(2400) > (.33)v(2500)+7(.66)v(2400) 
and 7(.33)v(2500) > m(.34)v(2400). These two inequalities require 
that 1 > 7(.66)+7(.34). Finally, subproportionality accounts for many 
of the violations of the independence axiom since it implies that for a 
fixed ratio of probabilities, the ratio decision weights will be closer to 
unity when the probabilities are high. 

A function satisfying these assumptions is plotted in Figure 3.10. 


Insert Figure 3.10 Here 


5. Time Preferences 


In many of the dynamic models considered in this book, we need 
to characterize how individuals evaluate payoffs they receive now as 
opposed to those the receive in the future. We typically assume that 
individuals weight current utility more than future utility (if for no 
other reason, we could die tomorrow). We use the idea of the discount 
factor to capture this intuition. Let 0 < 6 < 1 be relative weight that 
players put on utilities one period in the future. Utilities two periods 
in the future are weighted by 6” and so on such that utilities t periods 
in the future are discounted by 6°. 


5.1. Computing Payoff Streams. Often we model games in 
which there is no determinate end date as infinite games where the 
number of periods goes to oo. Clearly, in an infinite game, we can no 
longer simply add up the payoffs from each period in order to deter- 
mine the utility from a sequence of actions. Fortunately, geometric 
discounting helps to facilitate these calculations. 

We consider the easiest case first. Assume that an agent gets a 
payoff of u,; = u over an infinite number of periods. There are two 
ways to calculate the value function v® of this stream of utilities. 


Method 1: Note that we can write v~ as utdut0°ut+... =u sy. 0. 
+=0 


Since 0 < 6 < 1, 5) 6’ is a convergent power series. It is a well known 
t=0 


result that 5° 5° converges to 4; so that ve = 74. We can easily 


t=0 
derive the following facts about this particular power series: 
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T+1 
(1) a 


(2 ) oa 7 — = . 
S 62 _—§ +1 
(3) Vir 8 = 5 
Therefore, we can compute finite streams of utility easily as well. 


For example, the value of receiving u for T' periods is seer = 
u(1—67 +1) 


‘Method 2: Another way to derive v is using Bellman’s principle 
of optimality. Since v is an infinite stream of utilities, we should be 
able to write it as a one period utility wu plus the discounted value of 
an infinite stream of utility beginning one period hence. Therefore, 


v™~ =utdouv~ 


so that v° = ;4;. We can compute finite streams using this method 
as well. Again assume that the agent receives payoff u for T periods 
and we wishing to compute v’. We know that v? = v® — 6’ttu® so 
T4+1 

that v7 T5 a a a Le 

While the Aliana of fe method are small in the simple ex- 
ample of a constant stream of utility, they can be substantial in more 
complex settings. Now assume that there are n states of the world 
(51,....,5n). In each state, the agent receives u,. We assume that 
the state evolves according to a Markov process such that Pr(S; = 
8;|Si_1 _ 8;) = Tij- 

Now suppose we want to compute the value uv; of the stream of 
utilities beginning from state 7. Using Bellman’s principle, it is easy 


to see that 
Vj; = Up +0 »-, Tig Vi 
i=l 


This creates a linear system of n equations and n unknowns (the v;’s). 
Sometimes it will be easier to solve such a system by replacing one of 
the equations with the requirement that 5) 7;; = 1 for all /. 
i=1 

Let’s consider an easy example. Suppose we wanted to compute 
the long term payoff to a political party who values holding a particular 
office u; per period and gets a payoff of wu. in periods in which it does 
not hold the office. Suppose that there is an incumbent party effect 
so that if it holds office it wins with probability p > 5 and remains in 
office (state 1). However, this also implies that when it is out of office 
(state 2), it will remain out of office in the next period with probability 
p. With probability 1 — p, it transitions states either from office to 
out of office or vice versa. To compute, the party’s payoffs from being 
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in states 1 or 2, we can set up the relevant Bellman equations. Note 
that n = 2, 711 = 722 = P, M12 = M21 = 1 — p, and uw, > ug Thus, the 
Bellman’s equations are 

VU, = U, + 6(pv, + (1 — p)v2) 

U2 = U2 + 6((1 — p)u + pra) 


It is straightforward to derive 
(1 — dp) uw + 0(1 — p)ur 

1 — 26p + 5°(2p — 1) 
6(1 — p)ur + (1 — dp) u2 

1 — 26p + 5°(2p — 1) 
In these examples, we have taken the utility streams as exogenous 
(either fixed or based on a fixed set of probabilities. This can be relax 
significantly. Suppose that the agent chooses x; € X(s;) in every time 
period to maximize the discounted value of the stream u(;, s;) where 
S; € (S1,...-, $n) is the state of the world in time t. We may also allow 
the probability distribution of transitions from s; to depend on 2; so let 
1(S:41|%z, $,) be the probability of observing some state s;,, following 
state s, and choice 7;. We will only consider stationary plans i.e. those 
in which the prescription depends only on the state. Let x(s) be a 
stationary plan specifying the action taken when the state is s. 

We can then characterize the payoffs to implementing plan x(s) in 

state s as 


v(x(s), 8) = u(ax(s),s) +6 3 v(a(s’), s)m(s'|x(s), 5) 


Vg = 


Assuming that we can solve for u(x(s), s) for all plans, we can compute 
the optimal one as 


p(s ey u(x(s), 8) 


Bellman’s principle of optimality is that 


xEX(s) 


u*(s) = sup |u(z,s)+ 5S ov*(s')n(s' |x, s) 


5.2. Hyperbolic Discounting. While most of the literature on 
repeated games uses the model of constant discounting, there is a grow- 
ing literature in behavioral economics on alternatives more consistent 
with experimental evidence.* The most widely studied alternative is 


8The reader should review optimization in the mathemtical appendix before 
proceeding to this section. 
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hyperbolic discounting which assumes that at time 0 agents discount 
the utility at time t by 
A(t) = (1+at)-= 

for y > 0 and a> 0. Unless a is close to zero, hyperbolic discounting 
weighs the future much more heavily than constant discounting. It 
also implies that agents have a “time consistency” problem. What 
they consider optimal plan for time t depends on how far time t is in 
the future. Suppose that an agent has to decide how to allocate $1 
of consumption over three periods 0,1,2. Assume that U(x) = /x 
Using constant discounting the optimal plan solves 


Ji + 6/81 + 8 /F3 
such that Sox =|, 


The solution must satisfy x, = 6°x9 and xy = 6*29. Substituting into 
the budget constraints, we get that 


1 
ty = —_— 
9 14846 

5 
Tete 

54 
12 Ta pao 


Now consider what would happen if the agent re-optimized after con- 
suming 1% = ea in the first period. She would again optimally 


choose 1 = 6724. Substituting this into the constraint 7; + %2 = 
2 4 
oe we get 


1 mee 62 
Xx — 
448 14046 14845 
54 
ea ee 


Thus, she will wish to continue with her optimal consumption plan by 
consuming exactly as much as in period 2 as she had forecast. 

Now consider the same allocation problem when the agent uses 
hyperbolic discounting. To keep the algebra simple, let a = y = 1. 
Thus, the agent solves 


1 i 
VXLo + gV71 + gv 22 
such that Set =I 
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The first order conditions for the optimum are 7, = 520 and v2 = $20. 
Therefore, the solution is 
_ 36 
XO 49 
9 
Uy 49 
_ 4 
v2 = 49 
Again consider what happens if the agent re-optimizes after consuming 
Zo. Now the first order condition is ry = ae Substituting the 


constraint that 7; + x2 = z, we find that 


AB: 5D. 8 


wig AGT Ode 10 
_ 134 
72 945 ~ 49 


Thus, the agent will wish to change her optimal plan and shift more 
consumption to period 1. The reason for this anomaly is that she the 
relative weight of period 1 to period 2 consumption is higher in period 
1 than it was in period 0. 

While hyperbolic discounting has been useful in explaining experi- 
mental anomalies and temporal patterns in consumption (retirees con- 
sume less than a constant discounting model would predict), the appli- 
cations in political science have been few.? 


6. Exercises 


EXERCISE 3.1. Let Smith be a member of the House of Representa- 
tives. Smith is trying to decide whether or not to run for the Senate. 
He believes that he has a 50% chance of winning his party’s nomina- 
tion, and if he gets the nomination he has a 40% chance of winning 
the seat. Suppose that his utility from the Senate seat is W while his 
utility of losing, returning home, and running his family used car lot is 
L. His utility of keeping his House seat is H. 


(1) Using a lottery tree, describe the lottery involved with running 
for the Senate. 
(2) Compute the expected utility of running for the Senate. 


%One conceptual obstacle is that utilities over infinite horizons may not be well 
defined. Suppose that an agent evaluated an infinite stream of constant utilities 
lo ) 

u. Evaulation requires that the series 5> h(t)u converge. However, this will not 
i=0 
be the case for large set of a and y. 


6. EXERCISES 51 


(3) How low must H be relative to W and L before Smith will 
decides to run for the Senate? 


EXERCISE 3.2. Prove Theorem 8.1. 


EXERCISE 3.3. Compute the expected payoff of the following lottery. 
There are 5 periods. In each period, the agent flips a coin and receives 
one dollar for each consecutive period for which she has obtained heads 
i.e. if she has received heads x consecutive times, she receives $x. 


EXERCISE 3.4. Suppose that instead of always revealing a goat, 
Monte Hall randomly selects a door to open and thus occasionally re- 
veals the car. Clearly, a contestant should switch to the open door if 
the car is revealed, but should she switch to the closed door if a goat is 
revealed? 


EXERCISE 3.5. Suppose that a country is fighting ina war. In each 
period, it cost f > 0 to fight a battle. The country wins each battle with 
probability 7. The country wins the war and receives a payoff of w > 0 
forever if it wins two consecutive battles. If it loses two consecutive 
battles, it loses and receives 1 = 0 forever. The county discounts future 
periods by 6. 

There are five states corresponding to the consecutive wins and 
losses in battle. Two of these are terminal states corresponding to 
victory of loss of the overall war. For each of the non-terminal states, 
compute the expected utility of continuing the war. Find a condition 
for f in terms of 7, w, | for which the country chooses not to start the 
war. Find a condition for the country to surrender after losing one 
battle. 


EXERCISE 3.6. Prove that 1(p) > p and sub-additivity imply that 
the decision weight function 7 is convex for small values of p. 


EXERCISE 3.7. Consider the following pair of lotteries: 
Lottery a: .45 chance of $6000, .55 chance of 0 

Lottery b: .90 chance of $3000, .10 chance of 0 

Lottery c: .001 chance of $6000, .999 chance of 0 
Lottery d: .002 chance of $3000 and .998 chance of 0 
Which choices are predicted by Prospect Theory? Why? 


CHAPTER 4 


Social Choice Theory 


1. The Open Search 


In the pages that follow we consider a scenario that many read- 
ers of this book may soon encounter in their professional lives (if they 
have not already): the “open” faculty search. Consider a fictional 
political science department whose membership is spread evenly across 
five sub-fields: American (A), comparative (C’), international relations 
(I), theory (T), and formal theory/methods (F'). This year the fic- 
tional university is having a mediocre year financially so the dean only 
gives the department authorization for one additional hire. This dean, 
unwilling to alienate any of the department’s various factions, does 
not specify which field the department should, but tells the depart- 
ment “since you study politics you should be able to settle this fairly.” 
Those readers who have experienced a similar situation in their own 
departments should be smiling knowingly at the dean’s folly. 

Members of each sub-filed have homogeneous preferences as to which 
field the new hire should come. Indeed, each field has its own com- 
plete and transitive ordering over the field of the potential hire. These 
rankings are given by Table 4.1: 


Table 4.1 | 


T 
T 
I 

C 
P 
A 


So the department chair sets out to figure out how the department 
should decide. The first idea she entertains is to have the department 
vote based on plurality rule. Each member of the department is to cast 
a ballot for their favorite field and the one with the most votes wins. 
However, the chair quickly determines that the election would generate 
a five-way tie so she abandons that idea. Next she considers pair-wise 
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majority voting. Under this procedure, each field will be paired against 
each other field. If any field wins all of the pair-wise comparisons, the 
department would hire in that field. Sure that this is a fair way to 
decide, she implements this procedure in the next department meeting. 
The meeting begins with a vote between A and C’. Field C wins with 
support from 7’ and J. However, in the vote between C' and J, J wins 
3-2, however T beats I. While T survives the vote with F’, it loses to 
C’. Thus, every field is defeated in at least one pairwise vote. The 
chair’s procedure has failed to bring about any resolution. However, 
the chair does note that A loses in every pairwise vote and F loses to all 
fields except A. So at least she concludes that neither an Americanist 
nor a formal theorist should be hired. 

Frustrated the chair decides that a scoring system such as the one 
used to rank college football teams might do the trick. Undeterred 
by previous failures, she proposes that each department member rank 
each field. A top ranking will give the field 5 points, a second ranking 
4 points, and so on. If everyone voted according to his preferences, 
the chair calculated that the ranking would be C' (17 points), T (16 
points), J (15 points), F'(14 points), and A (13 points). However, 
before the vote could take place, a theorist citing an obscure 16th 
century philosopher claimed it was inappropriate to weigh fourth and 
fifth rankings so heavily. He suggested that the vote be based solely 
on ranking the top three. Such a procedure would guarantee that the 
outcome was a tie between C’ and T. Nevertheless, not feeling that the 
application of the dead philosopher’s theory was appropriate in this 
circumstance, the chair moved forward with the original procedure. 

However, she was taken aback by the results. The formal theorists, 
sensing the opportunity to be strategic, cast their ballots with T in the 
first position and C’ in the fifth position. This resulted in 18 points 
for T and only 16 for C’, an outcome preferred by F’. Infuriated by the 
perceived duplicity, the C’s called for a revote. Their plan was to drop 
T to the fifth position on their ballots and win 16 to 15. However, the 
chair quickly realized that T’ would simply drop C' to the bottom in 
retaliation which might even lead to J winning if they also cast their 
ballots strategically. She quickly adjourned the meeting. The next 
day she called the dean to have the line transferred to the economics 
department. 

That the search ended in failure is not surprising. The fundamen- 
tal result of social choice theory is that collective choice processes must 
either restrict the set alternatives or violate some desirable normative 
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properties. Furthermore, as we will soon see, all mechanisms for mak- 
ing collective decisions are subject to strategic manipulation by agents 
such as that perpetrated by F. 


2. Preference Aggregation Rules 


In this section, we lay out the basic notation and ideas for the 
formal analysis of preference aggregation rules. We will limit ourselves 
to the case of a finite set of agents N = {1,2,...,n} (n > 2) who are 
to choose some outcome from the set X.1 Our primary goal is to 
understand how the preferences of the individual agents map into the 
collective preferences so we assume that agent 7 has preference ordering 
R;on X. As in chapter 2, we assume that these preference orderings 
are complete and transitive. We will denote the set of all possible 
complete and transitive preference orderings as R . We denote a list of 
preference orderings for all n agents as p = {R1, Ro, ..., Rn} which we 
call a preference profile. The set of profiles is therefore R”. By B we 
denote the set of complete orderings on X. 


DEFINITION 4.1. A preference aggregation rule is a function f : 
R" B. 

A preferences aggregation rule is simple a procedure that takes the 
set of individual preferences orderings and produces a social preference 
ordering. While we use subscripted R; to represent individual order- 
ings, we denote the social ordering as R. As an example, consider the 
pairwise majority voting illustrated in the introduction to this chapter. 
We can formally define that function such that for two alternatives x 
and y, «Ry if at least as many agents xR;y as yR;x. Since a complete 
ordering can be produced for any set of preferences, this procedure sat- 
isfies our definition. Importantly, our definition does not restrict the 
outcomes of preference aggregation rules to be transitive. Indeed, in 
our fictional department pair-wise majority voting produces T P I P 
C PT. 

What properties would we like our preference aggregation rule to 
satisfy? Perhaps the most important feature would be the ability to 
generate a best outcome so that the agents will actually have some- 
thing to choose. In other words, we would like the social maximal 
set M(R,X) to be non-empty, and we would like this to be true for 
all preference profiles. However, we know from chapter 2 that this 
requires that R be transitive. 


‘Throughout this chapter, we will focus only on models of complete informa- 
tion so that we can speak interchangeably between choosing actions, policies, or 
outcomes. 
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DEFINITION 4.2. A preference aggregation rule f is transitive if for 
every p € R” the ordering R is transitive. 


Secondly, it would be nice that the rule be at least minimally de- 
mocratic so that the preferences of a single agent or dictator didn’t 
completely determine the social ranking of the alternatives. 


DEFINITION 4.3. A preference aggregation rule f is non-dictatorial 
if there does not exist ani € N such that for every p € R” for every 
x,y EX, «Py implies «Py. 


Next, we wouldn’t look to favorably at aggregation rules that pro- 
duced social rankings that all agents disagreed with. If all agents 
prefer x to y, society’s preferences should also reflect this ordering as 
well. This criteria was first expounded by the Italian economist Vil- 
fredo Pareto is often referred to as Pareto efficiency or optimality. 


DEFINITION 4.4. A preference aggregation rule f is Weakly Paretian 
if, for any x,y © X, if xP,y for everyi € N then xPy. 


Finally, it would be nice if the social preferences ordering for any 
two outcomes depended only on the individual preference orderings for 
those two outcomes. One of the reasons that the formal theorists 
were able to manipulate the outcome of the chair’s counting procedure 
is that the social ranking between C' and T’ depended on F’s relative 
preferences for F', A, and J.” From the perspective of a choice between 
C and T those preferences should be irrelevant. This property is known 
as the independence of irrelevant alternatives or IIA. 


DEFINITION 4.5. A preference aggregation rule f is independent of 
irrelevant alternatives if, for any pair of policies x,y € X and any two 
profiles p,p' © R” with «Ry if and only if cRiy for allie N, «Ry if 
and only if xR'y. 


These all seem like reasonable properties and each can be justified 
easily on normative or practical grounds (though the case for IIA is 
weaker). The properties of transitivity, Pareto optimality, and ITA can 
also be justified using the fact that they are all satisfied by an individual 
decision maker. Thus, these are the properties necessary so that social 
decision-making is as well behaved as that of a rational individual. 
Unfortunately, one of the fundamental results in all of social science 


This couting rule is known as the Borda count. For a formal definition, 
assume each agent has complete strict preferences and let r;(x) = |z: zPja| (read: 
number of outcomes prefered to x by agent 7). Then the Borda count rule is for 
all z,y € X «Py if and only in ey Ti(2) < Vien Tily)- 
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tells us that aggregation rules cannot satisfy all of these properties 
simultaneously . Arrow’s Theorem says that the only aggregation 
function that produce transitive preferences while satisfying the Pareto 
principle and IJA is a dictator, not a happy result unless your last name 
happens to be Castro. Thus, the only way social preferences act like 
individual preferences is if they are in fact individual preferences. 

We now state and prove Arrow’s Theorem. 


THEOREM 4.1. If X is finite and has at least three alternatives, then 
there is no preference aggregation rule f : R”— B that is transitive, non 
dictatorial, weakly Paretian and independent of irrelevant alternatives. 


We need one more definition before turning to Arrow’s Theorem. 


DEFINITION 4.6. Given a preference aggregation rule f a set L C N 
is semidecisive for x against y if for every p € R” with xPy (alli € L) 
and yP;x (all j € L°= N\L) we have xPy. A set L is decisive for x 
against y if for every p © R” with cPy (alli € L) we have xPy. A 
set L is decisive if for every x,y € X it is decisive for x against y. 


A convenient proof of Arrow’s theorem rests on first establishing a 
property about decisiveness. 


LEMMA 4.1. Assume f is a transitive preference aggregation rule 
that is independent of irrelevant alternatives and weakly Paretian. If 
LCN is semidecisive for x against y for some x,y € X then L is 
decisive. 


PROOF. Assume that L C N is semidecisive for x against y and 
that under the profile p € R” «P;z for all i € L. Consider a profile 
pe’ © R” such that for alli €¢ L xPiyPjz and for all 7 € L° yPrx and 
yPiz with z ¢ {x,y} and for alli € L° Riz iff eRjz. Since L is 
semidecisive for x against y xP’y. Since f is weakly Paretian yP’z. 
Since f is transitive xP’z. But since the preferences of L° on x, z have 
not been specified in p’ and both p and p’ agree on x and z (i.e. cRjz 
iff rRiz), the fact that f is IA implies xPz. Thus L is decisive for x 
against z. This of course implies that L is semidecisive for 7 against 
z and an analogous argument demonstrates that L is decisive for x 
against y. We now verify that L is decisive for y against z. Consider 
a profile p° € R” with yP?z for all i € L and p* € R” such that for 
all i € L yP'xP;*z and for all j € L° zP/ a and yP*x and for all 
i € L° yRoz iff yR*z. Since we have already shown that L is decisive 
for x against z we have xP*z. Since f is weakly Paretian we have 
yP*x. Since f is transitive we have yPtz. Since the preferences 
of only members of L have been specified on {y,z} by pt and both 
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p° and pt agree on y and z, IIA implies yP°z. Thus L is decisive 
for y against z. This of course implies that L is semidecisive for y 
against z. Relabeling the first step and using this fact implies that LD is 
decisive for y against x. Combining these (boldfaced) conclusions 
leads to the claim that L is decisive. O 


Thus if any group is every semidecisive for some pairwise com- 
parison than the group is decisive. For preference aggregation rules 
satisfying ITA and the weak Pareto criterion if group its way once, it 
gets its way on all comparisons. We now complete the proof of Arrow’s 
theorem by showing that either an individual is decisive or the entire 
collective is not decisive. The first finding violates the non-dictatorial 
condition and the second violates the weak Paretian condition. Thus 
the proof demonstrates the incompatibility of Arrow’s conditions. 


PROOF OF ARROW’S THEOREM. Assume that X is finite and has 
at least three alternatives. By way of a contradiction assume that we 
have a preference aggregation rule that is transitive, non dictatorial, 
weakly Paretian and independent of irrelevant alternatives. Given the 
lemma, for any set L C N either L is decisive or there is no pair x, y € 
X for which L is semidecisive for x against y. Consider two disjoint sets 
A,B CN (disjoint means that AM B = 0) which are not semidecisive 
for any x and y (and thus not decisive) Let C = N\{AU B}. Since 
n > 2, and no singleton set {i} is decisive, three such sets A, B,C 
exist. Now consider the profile p~ € R” with xP, yP; z for i € A; 
2P-xP-y for j € B; and yP, zP,-x fort € C. Since A and B are not 
semidecisive for any pairs, we must have zP~x and yPz. Since f is 
transitive we must have yP~x. This implies that the set AU B is not 
semidecisive for x against y. This means that the set is not decisive. 
Thus the union of two disjoint sets which are not decisive is also not 
decisive. Since f is not dictatorial no singleton set is decisive. This 
conclusion means that no (finite) union of individuals is decisive. But 
this implies that N is not decisive. This contradicts the assumption 
that f is weakly Paretian. Thus the result is established. LC] 


The introduction to this chapter provides many examples of the 
implications of Arrow’s theorem. We have already pointed out that 
pairwise majority voting is not transitive and that the Borda count 
does not satisfy IIA. As an additional example, consider unanimity 
rule which we define as «Py if and only «Ry for all i and x«P;y for 
some i. Clearly, this rule satisfies the weak Pareto criterion and it 
satisfies ITA since the rule operates only on pairs of alternatives. But 
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it is not transitive. To see this suppose that we have the following 
individual preference orderings: 


Table 4.2 


3 
g 
x 
y 


Clearly, we have rRy and yRz. However, zPx. 

It is important to note that a preference aggregation rule has as 
its domain the set of all possible preference profiles. Thus, Arrow’s 
theorem does not does not rule out the possibility that there is a satis- 
factory way to aggregate preferences for a given profile. One response 
to Arrow’s theorem is to consider restrictions to the set of profiles and 
consider whether there are preference aggregation rules that satisfy the 
normative axioms on this smaller set. 

One of the most common restriction is single-peakedness. Intu- 
itively, single-peakedness means that there is some way of ordering 
the outcomes so that each agent’s preferences rankings increase up to 
the most preferred outcome and then decline after that. Consider 
Figure 4.1 which plots preferences ordering for our fictional political 
science department. Given ordering of the outcomes ACIT'F’, only 
fields J and T have preferences with a single peak as their preference 
rank is always increasing up to their ideal outcome and declining af- 
terwards. ‘The other fields have multiple peak preferences over the 
ordering ACITF. For example, field A has peaks at A and F. The 
motivated reader can verify that there is no way to order the outcomes 
so that all preferences have a single peak. Thus, the preference profile 
of our fictional department is no single-peaked. | However, consider 
the following profile: 


Insert Figure 4.1 Here 
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Now we if we order the outcomes TCIAF (or FAICT) then all 
fields have a single peak at the outcome associated with their own field 
as illustrated in Figure 4.2. To foreshadow our main result, consider 
the outcome of pairwise majority voting. Note that now IJ defeats 
all of the other alternatives. Furthermore, pairwise majority voting 
produces the transitive ordering | P A P C P F PT, the identical 
to the preferences of J. Is it a coincidence that majority voting works 
well with our new single peaked preference profile? No, as we will see 
that single-peakedness is a sufficient (but not necessary) condition for 
the transitivity of majority rule. 


Insert Figure 4.2 Here 


Before stating and proving this result, we need a bit more notation. 
Let q be an ordering function which takes the set of outcomes and 
assigns each a unique rank. Formally, gq: X — {1,2,..,|X|} is a one- 
to-one and onto function (or bijection).? Now we can formally define 
single-peakedness. 


DEFINITION 4.7. Given a set N and a choice space X a preference 
profile p € R” is single-peaked if there exists some bijection gq: X — 
{1,2,..,|X|} such that for every i € N there is some t; € X such that 
if aly) < q(ti) then tiPiy (and if q(x) < q(y) < q(ti) then tiPiyP;x) 
and if q(t:) < q(b) then t;Pib (and if q(ti) < q(b) < q(c) then tiP;bP;c). 
The set of single-peaked profiles is denoted S CR”. 


In the definition the policy t; is interpreted as i’s ideal policy, and 
the further the rank q(y) is from q(t;) the less the agent prefers y. Thus, 
singlepeakedness implies that xP;y if |q(ti) — q(x)| < |¢(ti) — ¢(y)|. 
This inequality implies that if g(a) > q(y), «Py if and only if q(t;) > 
ey Conversely, if g(y) > q(x), «Py if and only if q(t) < a 

We can now formally state the theorem. 


THEOREM 4.2. Given p € S majority rule is transitive, weakly 
Paretian, ITA and non dictatorial. 


The proof is very straightforward, but we will simplify a little bit 
by assuming that there are an odd number of agents. We begin by 
showing the preference ordering majority rule produces the preference 
ordering of the agent with the “median ideal point.” The median ideal 
point is defined as t,, such that q(ti) > (tm) for exactly ¥ agents 


3A function q: X — X is one-to-one if for every y € X the set q~1(y) = {x € 
X;q(x) = y} is a singleton. The function is onto if for every y € X there is some 
cEX st. g(x) =y. 
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and q(ti) < g(tm) for the remaining agents. The claim is that the so- 
cial preference matches agent m’s preference so that «Py if and only if 
«Py. Let’s prove necessity first. Suppose that xP,,y. Ifg(x) > q(y), 
then all agents with q(t;) < q(tm) prefer x to y. Since there are 4 
agents preferring x to y, zPy. If q(x) < q(y), then m and the = 
agents with q(t;) > q(tm) prefer x so that «Py. To show sufficiency, 
suppose that «Py. Since at least Ait must prefer x to y, at least one 
agent with q(ti) < q(tm) and one At with q(t;) > q(tm) prefers x to 
y. Denote these agents as / and h respectively. Suppose that yP,2x, 
but that «Py and xP,y. Suppose that q(x) > q(y). Then single- 


peakedness and yP,,x implies that ae) tay) > q(tm) while «P,y and 


xPy imply that q(t) > > a) and alt ae 5 aera This contra- 
dicts q(tn) > q(tm) > (ti). Finally, since the social preference profile 
matches that of the median agent, it has the properties of individual 
preferences which include transitivity and IIA. The social ordering 
is clearly weakly Paretian since the median’s preferences never con- 
flict with the social preferences. However, it might seem that m is 
a dictator. However, recall that a dictator is one whose preferences 
determine social preferences for any preference profile. Since different 
profiles produce different median agents, m is not a dictator. 


3. Collective Choice 


While it is useful to begin with the properties of aggregate prefer- 
ence orderings, we are ultimately interested in the set of policies that 
are maximal for a preference aggregation rule. As we did for indi- 
viduals, we will assume that the social choices is the outcome that 
is maximal choice from the aggregate preference ordering. In social 
choice setting, we refer to the set of maximal choices as the core. 


DEFINITION 4.8. Given X, p € R” and a preference aggregation 
rule, the core is defined as C'f(p)(X) = M(f(p), X). 


Applying theorem 1 of chapter 2, we know that if X is finite and 
the collective preference is complete and transitive then the core is non- 
empty, and the social choice is well defined. However our analysis of 
Arrow’s theorem indicates that transitivity of preference aggregation 
rules is not always satisfied. In such a case, we say that the core is 
empty or does not exist. 

However, given the results of the last section, we know that majority 
rule is transitive under single peaked preferences is transitive so that 
a core does exist. Since the social preference under majority rule 
and single-peaked preferences is the preference ordering of the agent 
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with the median ideal point, it follows that the majority rule core 
is tm, When a majority rule core exists, its outcome is known as a 
Condorcet winner after the Marquis de Condorcet who was among the 
first to formally study the properties of voting procedures. However, 
the result that the majority rule core is non-empty with single peaked 
preferences is generally attributed to Duncan Black. 


THEOREM 4.3. Ifn > 2 is odd andp€S_ then letting f(-) be ma- 
jority rule, Cy (X) = {ti : |G E N\i:t; <4] = [hE N\O: ty > t,]}. 
That is the core is the median voter’s ideal point. 


The proof is essentially the same as that of the transitivity of ma- 
jority rule with single peaked preferences. This result indicates that 
if we are willing to assume that preferences satisfy the restrictive as- 
sumption of single-peakedness the majority rule core is well defined, 
and submitting to the "will of the majority” may be a reasonable way 
to make collective choices. However the restriction may not be ap- 
propriate for some settings. For example, suppose the set of policies 
is two dimensional. Then as we will see, the generalization of single 
peakedness is extraordinarily restrictive. 

Consider Figure 4.3, which gives ideal points for five voters in two 
dimensions. We will assume that each agent has circular indifference 
curves so that it will vote for the alternative closest its ideal point. 
Our claim is that point 5 is a majority rule core point or the Condorcet 
winner as a majority prefers it to any other point in the policy space. 
To demonstrate this claim, we need to show that at least three voters 
will block any other policy. First, consider a move to any policy in the 
region marked W. Obviously, voter 5 will vote against any such move. 
as will voters 1 and 3. Thus, 5’s ideal point is majority preferred to 
any policy in region W. Similarly, voters 1,2, and 5 will vote against 
moves to region X, voters 2,4, and 5 will vote against region Y, and 
3,4, and 5 vote against region Z. 


Insert Figure 4.3 Here 


The reason voter 5’s ideal point is in the core is that voter 5 would 
be the median voter over any two alternatives in that if she prefers x 
to y at least two other voters will as well. Since this must also be 
true for comparisons of the ideal points of other players, voter 5’s ideal 
point must lie on the lines connecting opposing pairs of ideal points 
(2-3 and 1-4). Thus, this condition (which we formalize in the next 
section) is fragile. A slight deviation from the intersection of these 
lines as in Figure 4.4 destroys the majority core. First, note that the 
intersection cannot be a core point because 3,4, and 5 prefer 5’s ideal 
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point to the intersection. Secondly, note that 5’s ideal point cannot 
be a Condorcet winner, because there are a set of points Y’ that are 
preferred by 1,2, and 3. Therefore, the existence of a “median in all 
directions” is a very restrictive assumption. 


Insert Figure 4.4 Here 


Given that the conditions on the existence of a majority rule core 
are so restrictive, an obvious question to ask is whether majority rule 
can at least reduce the set of possible outcomes by ruling some out as 
undesirable. Recall that in our introductory example, the department 
chair felt that it was reasonable to conclude that the department should 
not hire in American or Formal Theory since both of those fields were 
defeated by the three other fields. Thus, majority rule could eliminate 
two options even though it created a cycle among the three top alter- 
natives. In this example, C’, J, and T represented a top cycle: a set of 
alternative that defeat all alternatives outside the set, but over which 
the aggregation rule is intransitive. Thus, perhaps while majority rule 
does not produce a core, it can produce a small top cycle. Unfortu- 
nately, this optimism is also unwarranted. Richard McKelvey showed 
that with under sincere voting (given any pair the of alternatives, the 
agent votes for the closest pair) and Euclidean preferences, the top cy- 
cle was either the core or the entire set of alternatives. While we treat 
this result more formally in the next section, the main intuition can be 
seen in the example presented in Figure 4.5. Here we have three voters 
with quadratic preferences. Assume that there is an initial status quo 
policy a. To illustrate McKelvey’s result, it is sufficient to demonstrate 
that pairwise majority voting can lead from point a to anywhere in the 
policy space and then return to a. First, note that point b is majority 
preferred to a since voters 1 and 2 prefer it. Continuing note that 
2 and 3 prefer c to b, and 1 and 3 prefer d to c. Note that at each 
subsequent stage of the agenda, the set of policies that are preferred to 
the current status quo is getting larger. This allows us to reach points 
further and further away from the voters’ ideal points. Finally, note 
that 1 and 2 prefer the very distant point e to d. From e, we can either 
return to a (the voter’s unanimously prefer a) or we can leverage e to 
get to points further and further away. 


Insert Figure 4.5 Here 


As a positive methodology the study of preference aggregation rules 
does not offer clear predictions. This is best exemplified by the result 
from McKelvey showing that it is generally the case that any policy can 
beat any other policy through a finite agenda. Some have interpreted 
this result as a prediction of chaos, whereby the theory predicts that 
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politics should be chaotic with observable cycles. This interpretation 
is naive, as it attributes a positive prediction to results that state that 
social choice does not generally offer predictions. A more reasonable 
interpretation is that the results demonstrate the need to investigate 
the political institutions within which collective choice is made. Un- 
der this interpretation the conclusion is that a model that takes as 
primitives only preferences and a preference aggregation rule may be 
underspecified. The tools of non-cooperative game theory will allow 
us to construct richer theories of collective choice. 


3.1. Formal Analysis of the Plott and McKelvey Results**. 
In this section, we present a much more formal analysis of Plott and 
McKelvey’s results about majority rule when preferences are multi- 
dimensional. All of the following analysis is based on the following 
social choice environment. 


CONDITION 1. We assume X C R®¢ (d finite) is conver and agents 
have strictly convex, continuous preferences on X . 


If X is compact then Theorems 3 and 4 of chapter 2 imply that each 
agent has a unique ideal point y; in X. Instead of assuming that X 
is compact, we will assume directly that each agent has an ideal point. 
The assumption that preferences are strictly convex requires that the 
upper contour sets are strictly convex sets. The classic special case of 
Euclidean preferences, u;(2) = — || — y;|| is convenient as in this case 
the upper contour sets are spherical. If we assume that preferences 
are Euclidean, we can specify exactly how utility changes as the policy 
alternatives are varied. 


DEFINITION 4.9. If preferences are Euclidean then for any x © X 
the gradient vector Vuj(x) = y; — 2. 


The gradient vector is a directed vector or line segment that points 
away from the origin in the direction that agent 7 would most like policy 
to move from point 2. 

The statement of Plott’s result also uses the notion of a pairing. 
For a finite set A we will call a mapping p: A — A a pairing if it is 
one-to-one. This means that each 7 in A is paired with exactly one 7 
in A. Now we can state Plott’s conditions. 


DEFINITION 4.10. In the spatial model with Euclidean preferences 
the Plott conditions are satisfied at a policy x € X if there exists a 
pairing p(-) on the set L={j € N:y; 4x} such that for everyi € L, 
Vui(r) = —A\;iVupay(x) for some r; > 0. 
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The intuition is that when the Plott conditions are satisfied at x 
the set of agents L that do not have x as their ideal point, can be 
paired so that each agent that wants to move in a particular direction 
is offset by a particular agent that wants to move in exactly the opposite 
direction. Since proponents of any change are paired with opponents, it 
is impossible to build a majority coalition to overturn x. The following 
result characterizes the relationship between the Plott conditions in the 
spatial model with Euclidean preferences and the majority rule core. 


THEOREM 4.4. In the spatial model with Euclidean preferences and 
n odd the point x in the interior of X is in the core Cy(p)(X) if and 
only if the Plott conditions are satisfied at x.* 


Even in the restrictive case of Euclidean preferences, it is clear that 
the Plott conditions will not in general be satisfied. Suppose that 
the conditions are satisfied for some x. This implies that for all 7, 
y, —-L = —A; (Yi) — ey If we perturb y,(;) so that it lies on different 
vector from the origin, this condition will not longer hold at 7. More 
precisely if we think of the space R@” as the space of possible ideal 
points of n agents with Euclidean preferences on the choice space R4 
then the subset of R?” for which the Plott conditions are satisfied at 
some x € R? is incredibly small. Specifically it contains no open sets 
and thus has an empty-interior. Informally stated, if one imagined 
randomly picking an arbitrary profile from this space the probability 
of selecting one that satisfy the Plott conditions for some point would 
be 0. 

While the set of profiles with a core point is very small, for any 
such profile there is another profile that is arbitrarily close and also 
has a core point. In the exercises, we ask the reader to show that if 
one consider small perturbations that also yield a core point, then the 
core point is only perturbed a little.° 

We may conjecture that even though the core is generally empty, 
there is some other subset of the policy space which possesses nor- 
matively desirable properties and is therefore a reasonable prediction. 
One such concept is the following. 


4A policy x is in the interior of X if there is an open ball B(x,¢) that is 
contained in X. 

©The assumption that preferences are Euclidean can be replaced by a differen- 
tiability condition to produce a more general result. 
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DEFINITION 4.11. For a set X a profile p © R” and a preference 
aggregation rule f the top cycle set Ty) 1s the set 


Tig = {ee X Vy] ANG, af{a6,...,04} CX 
64 Oe = Wea SUT and V2< 7 a-1P as} 


The top cycle set is the set of points that can be reached from 
any other point via a finite chain of strict preferences. That is if 
x € Typ) then for every y € X\a we can select a finite number of 
policies {a1, @2,....,a;} for which #Pa,Pa,P....Pa;Py. The following 
result indicates that either the Plott conditions are satisfied or the top 
cycle set covers the policy space. 


THEOREM 4.5. In the spatial model either Cf(p)(X) is non-empty 
or Le — 


The implications of the last two theorems are striking. In the 
spatial model with Euclidean preferences unless a knife-edged condition 
holds (Plott conditions) any policy can be reached by any other policy 
in a finite chain of strict preferences. 


4, Manipulation of Choice Functions 


As the previous section illustrated, majority rule very often fails 
to provide sufficient guidance for making social choices. | However, 
even in conditions in which a majority core exists, agents may not 
have the incentive to reveal their preferences truly so that the choice 
function can be implemented. As Gibbard (1973) and Satterwaite 
(1975) have shown, all social choice functions including majority rule 
are susceptible to manipulation of agents acting strategically. We have 
already seen one such example in the attempt of the formal theorists to 
manipulate the Borda count by misrepresenting their preferences over 
the fields. Such manipulation is also possible in voting. Consider 
a voting agenda where ~ is first paired against y and then against z. 
Assume that by majority vote, rPy, zPx, and yPz. Thus, if all voters 
voted according to their actual preferences, x would defeat y and then 
lose to z. However, voters who preferred x to y and y to z would have 
an incentive to vote strategically for y (misrepresent their preferences 
between x and y) in the first round so that y might win round 1 and 
go on to defeat z. 

To formalize the Gibbard-Satterwaite theorem, we need some ad- 
ditional notation and definitions. 


DEFINITION 4.12. The social decision function is an onto function 
G:R”" — X that generates an outcome given a preference profile. 
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Thus, a social decision function takes a preference profile and pro- 
duces an outcomes (as opposed to a preference ordering). The require- 
ment that G be “onto” means that every profile produces an outcome 
and that every outcome can be supported by some profile. Now we 
can define manipulation. 


DEFINITION 4.13. G(p) is manipulable at p if and only if for some 
‘ there Cnsis p = Risa ha ag Risse, Pent 
such that G(p')P:G(p). G is non-manipulable if it is not manipulable 
at any p. 


Thus, we say that a social decision function is manipulable if a single 
agent finds that it can change the outcome to one it prefers by reporting 
something other than her true preferences. In the definition, agent 7 
changes the outcome from G(p) to G(p’) (an outcome she prefers) by 
claiming preferences Ri rather than R;. We can now state Gibbard 
and Sattherwaite’s result. 


THEOREM 4.6. If there are more than three alternatives and G is 
non-manipulable, then there is a dictator (i.e. for some i such that 
G(p)P;x for all x and for all p €.R"). 


The outline of the proof is as follows. First assume that G is non- 
manipulable. Then we can construct a transitive and IIA preference 
ordering by applying G to all of X to get the most preferred outcome 
and then applying G to the remaining elements of X to get the second 
outcome, and so on. We can then show that this ordering satisfying 
the weak Pareto criterion. Thus, by Arrow’s theorem, we must have 
a dictator. 

Now consider the details. Suppose that G is non-manipulable. For 
step 1, let pand some B C X be such that for alli, 7 € B, andy € X/B 
then xP;y. Thus, B is a set of alternatives that all agents prefer to all 
alternatives not in the set. Our first claim is that the social decision 
should be from this “best” set or that G(p) € B. Suppose that this 
were not true. Since G is onto, we can pick an alternative profile p’ 
such that G(p’) € B. Then we can construct a series of alternatives: 


Yo = G (p) 
i= Gl plRixe HG Rye ah,) 
Yn =G(p') 


Now let k be the smallest integer such that y, € B. Since agent k 
prefers everything inside B to everything outside, she can get a better 
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alternative by reporting Rj, instead of R,. This contradicts G being 
non-manipulable. Thus, it must be true that G(p) € B. 

The next step is to create a aggregation rule. Let the highest ranked 
element be x; = G(p). Then move 2; to the bottom of everyone’s 
preference ranking to create p,. Then let the second ranked choice be 
L2 = G(p). From step 1, we know that ry #4 x;. We can continue 
this process until we have ranked all of the alternatives. 

It is rather easy to see that the preference ordering will satisfy 
the weak Pareto criterion, since at every stage the decision rule has to 
choose an element of the “best” set for the constructed profile. So now 
we need to show that our aggregation rule is IITA. Suppose that it were 
not. Then there must be two profiles p and p’ and alternatives x, y 
€ X such that «Ry if and only if «Ry for alli but that x f(p) y and y 
f(p') x. Let p(x,y) be the profile that agrees with p everywhere except 
that « and y are moved to the top of everyone’s ordering. We claim 
that G(p(z,y)) = x. Suppose it were not. Then let p be the profile 
created by dropping alternatives to the bottom until G(p ) = a. Then 
consider a sequence y; = G(p(x,y)|Ra, ., By) so that y; is the social 
decision created by switching the first 7 agents to the new profile. Note 
that yn. = G(p) = x and yo = G(p(z,y)) = y since step 1 implies 
G(p(a,y)) € {x,y}. Similar to above, let k be the smaller integer 
such that y, Ay. If y, = x, then if rP,y G can be manipulated by 
switching from R, to Ry. Alternatively, if yP,x2, a switch from R, 
to R, manipulates G. If y, # x, then consider smallest 7 > k such 
that y; € {x,y}. Using exactly the same logic as above, agent j can 
manipulate G. So we have contradicted the assumption that G is 
non-manipulable which implies that G(p(x, y)) = 2. 

Now consider a sequence z; = G(p(zx, y)|Ri (x, y),..., Ri(x,y)). The 
assumption of no IIA implies that z, = G(p'(a#,y)) = y and 2% = 
G (p(x,y)) =x. Thus, as above, there must be some agent who prefers 
y to x and can switch their preference from R(x, y) to R’'(a,y) and 
change the outcome from x to y, or an agent preferring 7 to y who 
can switch from R’(x,y) to R(x,y) Thus, G is manipulable. This 
contradiction implies that the preference ordering must be ITA. Thus, 
by Arrow’s theorem, there must be a dictator for f and therefore a 
dictator for G. 

While essentially a negative result, this theorem does have impor- 
tant implications about the study of politics. Perhaps the most im- 
portant is that strategic behavior is likely to be ubiquitous in politics 
in that mechanisms that are “strategy proof” typically do not exist. 
Thus, the next chapter begins our study of strategic models of politics. 
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5. Exercises 
EXERCISE 4.1. Suppose players have the following preferences: 
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(1) What are the Borda counts for each of the alternatives? 

(2) How can player 1 do better by misrepresenting her preferences? 

(3) How can player 2 do better by misrepresenting his preferences? 

(4) Is there any combination of statements (not necessarily truth- 
ful) for which the two players would not have an incentive to 
change, ex post? 


EXERCISE 4.2. Suppose there are three voters who are to decide on 
an alternative via pairwise majority rule. If there are three alterna- 
tives, all preferences are strict, and each voter has a different preference 
ordering from the other two, what percentage of the possible combina- 
tions of preferences result in a Condorcet winner? (Note that if two 
individuals share a common preference ordering, their most preferred 
must be a Condorcet winner. Why?) 


EXERCISE 4.3. Assume that there are three voters with Euclidean 
preferences in two dimensions with ideal points at (—1,0), (0,1), and 
(1,0) respectively. a. Construct an agenda to get from (0,0) to (2,2) 
b. Construct an agenda to get from (0,0) to (5,5). c. Construct an 
agenda to get from (0,0) to (-5,-5). Try to keep these agendas as short 
as possible. 


EXERCISE 4.4. Show that if p € R® is a profile of ideal points for 
which the Plott conditions are satisfied at some x € R®% then for every 
é > 0 there exists a profile p* € B(p,e) for which the Plott conditions 
are not satisfied at any point for the profile p*. 


EXERCISE 4.5. Show that if p € R®” (n odd) is a profile of ideal 
points for which the Plott conditions are satisfied at some x € R®% then 
for everye >0 there exists a 6 > 0 such that if p> € B(p,5) and the 
Plott conditions are satisfied for some point at the profile p> then the 
Plott conditions are satisfied for a point x' € B(a,¢) by the profile p°. 


CHAPTER 5 


Games in the Normal Form 


About 12.5 minutes into the broadcast, two murder suspects are 
arrested by Detectives Logan and Briscoe. District attorney Adam 
Schiff instructs assistant DA Jack McCoy to make the following offer 
to each separately: 


e Confess and provide evidence of first degree murder by your 
accomplice. If she does not confess, you get a 1 year sentence 
on a weapons charge. If she does confess as well, you both get 
8 years for murder II. 

e Hold out. If your accomplice turns state’s evidence, you will 
serve 25 to life for murder I. If she holds out, you will get 4 
years for voluntary manslaughter. 


Assuming each suspect loses a unit of utility for each year in prison, 
the following table shows the payoffs of each subject given all of the 
possible outcomes. The rows represent the choices of suspect 1 while 
the columns represent the actions of suspect 2. Each pair of numbers 
represents the payoffs for suspect 1 and suspect 2 respectively for each 
combination. 


Table 5.1: The Prisoner’s Dilemma 
1\2 Hold Out Confess 

Hold Out -4,-4 -25,-1 

Confess -1,-25 -8,-8 


Note that the situation is strategic in the sense that the outcome 
of any action by suspect 1 depends on the choices of suspect 2 and 
vice versa. What should we expect the suspects to do? Collectively, 
they would like to hold out. If they both hold out, the total jail time 
would be only eight years, far less than any other outcome. However, 
unless they can reach some kind of binding agreement, its clear that 
the individual incentives of the suspects will preclude this outcome. 
Suppose that suspect 1 were to hold out, then suspect 2 would recognize 
that she could then do better by confessing, reducing jail time from 4 
years to 1. Suspect 1 would have the same epiphany, making the 
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“socially optimal” agreement impossible. In fact, both suspects will 
recognize that she will do better by confessing regardless of the other’s 
actions. Thus, they both confess, leading to 16 total years of jail. 
Thus, individual rationality leads to socially inferior outcomes (where 
society refers to the suspects, the DA and the police presumably prefer 
the outcome).' 

In this game, the well-known “Prisoner’s Dilemma”, it is fairly 
straightforward to deduce what strategies rational actors will choose. 
However in other strategic situations, the predictions are more sub- 
tle. Consider the following game which we call the “Terrorist Hunt.” 
Suppose that there are two agencies, the FBI and the CIA, which are 
responsible for investigating and apprehending terrorist suspects. We 
assume that there are two types of suspects, kingpins and operatives. 
Both agencies prefer capturing kingpins to capturing operatives to cap- 
turing no suspects. However, to capture a kingpin, the two agencies 
must cooperate by dedicating resources to a joint effort. Thus, if one 
agency fails to cooperate in the investigation, the other agency will fail 
to capture any suspects. On the other hand, each agency can capture 
an operative simply be acting on its own. Given this setting, we can 
now illustrate the strategic situation that each agency faces in deciding 
whether to go after the kingpin or the operative. 


Table 5.2: The Terrorist Hunt 
FBI\CIA | Kingpin Operative 
Kingpin 2,2 0,1 
Operative 1,0 1,1 


The rows of this matrix represent the possible strategies of the FBI 
(hunt kingpin or hunt operative) while the column represents those of 
the CIA. For both agencies, we have assigned a utility of 2 for the 
kingpin, 1 for the operative, and 0 for failing to capture either. Let’s 
begin with the FBI’s decision. Unlike the Prisoner’s dilemma, the 
FBI’s best choice depends on the CIA’s choice. If the CIA hunts the 
kingpin, the FBI gets 2 from cooperating on that search versus 1 for 
hunting an operative by itself. However, if the CIA strikes out on its 
own, the FBI gets 0 from hunting the kingpin while it could generate 
1 by pursuing an operative. Thus, the FBI’s choice depends on what 
it believes the CIA will do and vice versa. So what is it reasonable 


‘Not to leave the reader in limbo, here is a quick summary of the rest of the 
episode. The confessions are thrown out on a technicality by an Upper Westside 
judge. The episode ends with a pithy piece of wisom by Shiff just as McCoy pours 
himself an 18 year old Scotch. 
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for each to believe? A key development in the study of strategic 
interaction was John Nash’s characterization of rational behavior in 
such situations. In Nash’s formulation, each agency should choose 
strategies that are “best responses” to the action of the other agency. 
If both agencies pursue such a course, the outcome is a best response 
to a best response. Since neither agency has a incentive to change it 
strategy, such an outcome is known as a Nash equilibrium. 

To see whether an outcome is Nash equilibrium, it suffices to show 
that both agencies are doing their best given the actions of the other 
agencies actions. So consider whether the outcome where both agencies 
hunt the kingpin is a Nash equilibrium. Given that the CIA is hunting 
the kingpin, the best choice of the FBI is to hunt the kingpin. Similarly, 
the CIA’s best response to the FBI’s choice of the kingpin is to hunt 
the kingpin itself. Thus, both agencies pursuing the kingpin is a Nash 
equilibrium. However, this is not the only Nash equilibrium of the 
game. If the CIA decides to hunt the operative, the best that the FBI 
can do is to also settle for the operative. Since the CIA also prefers 
the hunt the operative when the FBI does, both agencies tracking an 
operative is also a Nash equilibrium.” While Nash’s solution does not 
lead to a single prediction, it does rule out some outcomes. A situation 
where one agency hunts the kingpin while the other tracks an operative 
is not a Nash equilibrium since the agency hunting the kingpin would 
do better by switching to a search for an operative. Conversely, the 
agency hunting an operative would like to deviate from its strategy to 
search for the kingpin. 

In the remainder of this chapter, we formalize and extend the con- 
cepts and issues raised by these examples. 


1. The Normal Form 


The first issue that one encounters when using game theory to model 
political phenomena is the question of how to represent the strategic 
situation. We begin with the simplest representation of a strategic 
situation: the normal form representation with complete and perfect 
information. Such a representation is based on the following elements: 


(1) Agents: We let N represent the set of agents. When we wish 
to refer to an arbitrary agent, we use the notation i € N. We 
also use the symbol —i € N (read “not agent 2” ) to refer to 
all agents other than 7. 


?While we defer the discussion of such possibilities, there is also a third equilib- 
rium where each agency puruse the kingpin with probability .5 and the operative 
with probability .5. 
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(2) Pure Strategies: A pure strategy is an agent’s plan of action 
such as “confess” or “hunt an operative” in our motivating 
examples. In a game with a single interaction, as our ex- 
amples, a strategy is simply an action. However, in a game 
with multiple interactions, a strategy is specifies the action 
to be taken at each stage of the game. In a normal form 
representation, we must specify the set of pure strategies for 
each player which we denote S; for each i € N. An arbitrary 
strategy by agent 7 is given by s; € S;. Given the strategy 
sets for each agent, we can generate the set of all possible 
strategy profiles S by computing all possible combinations of 
strategies. Formally, S = xjcy5;. A profile is then a vector 
S = (81, ...,8i,--,8n) € S. By S_; = Xjen\aSj; we denote the 
space of strategies for every player except 7. To economize on 
notation, we often represent s as (s;,s_;). Below we discuss 
extensions of the definition of strategies which allows agents 
randomize over pure strategies. 

(3) Payoffs: A normal form representation requires a specification 
of a von Neumann-Morgenstern utility function for each player 
over the set of strategy profiles or u;(s) : S — R'. Sometimes 
the utility function for i is denoted u;(s;,s_;). As in chapter 
3, the functions u;(-) are Bernoulli utility functions, and given 
any lottery over S the agent calculates her expected utility 
under the lottery. 


An interpretation of a normal form game is that at period 1 each 
player chooses their strategy s; € 5; and in period 2 the agents receive 
ui(s) where s = (51,...,5,). We will see that games with more periods 
can actually be reinterpreted as very large normal form games. 

Accordingly a normal form game is completely defined by 
(N, {S;, u(-, ...,°) }ien) We sometimes use the shorthand (N,S,u) to 
represent a game where u without a subscript represents the vector 
of utility functions (w1(-),...,Un(-)). Some simple but quite interesting 
games involving two players can be represented as matrices. 

To cement ideas, we know show that our two motivating examples 
can be fully described using the normal form. First consider the Pris- 
oner’s dilemma. Clearly N = {player 1, player 2}, S$; = Sy = {hold 
out, confess}. We can also write the payoff functions as 
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—8 if s; = s_; = confess 

—4 if s; = s_; = hold out 

—1 if s; = confess & s_; = hold out 
—25 if s; = hold out & s_; = confess 


Ui(Si, Si) = 


Similarly, the Terrorist Hunt can be represented as N = {CIA, FBI}, 
S; = Sy = {hunt kingpin, hunt operative} and 


2 if s; = s_; = hunt kingpin 

1 if s; = s_; = hunt operative 
1 if s; = hunt operative & s_; = hunt kingpin ~ 
0 if s; = hunt kingpin & s_; = hunt operative 


Ui(Si, S_i) = 


Note that we were able to represent both of these normal forms with 
matrices. For two agent games, the relationship between the normal 
form and a game matrix generalizes to: 


Table 5.3: Generic Normal Form Game 


12 S21 S22 an S2k 
S11 u(Si, S91) u(Su, S99) se u(Si, Sox) 
S12 u(sit, S91) u(si2, $92) me u(S1a, $22) 


su_| lsu, $21) | (si, $22) [+++ | ulsir sax) | 


where N = {1,2}, S, = {811,..., Su}, and Sp = {591,..., Sox}. 

Representing a normal form with more than two players in a matrix 
is more difficult since it is difficult to represent the strategy combina- 
tions in two dimensions. However, sometimes it is useful to repre- 
sent three player games using the following trick. Suppose we ex- 
tended the terrorist hunt to include a third agency, the National Se- 
curity Agency (NSA) whose strategy set is the same as the other two 
S3 = {hunt kingpin, hunt operative}. We assume that capturing the 
kingpin requires cooperation by at least 2 agencies, but that each can 
capture an operative on its own so that the payoff function for the FBI 
is now: 


2 if sy = kingpin or s3 = kingpin 
ui(S1, 5-1) = 4 1 if 5; = operative 
0 if sy = operative & s3 = operative 


Now consider the following pair of matrices: 
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Table 5.4: Game if NSA Hunts Kingpin 


FBI\CIA | Kingpin Operative 
Kingpin 222 212 
Operative | 1,2,2 1,1,0 


Table 5.5: Game if NSA Hunts Operative 


FBI\CIA 
Kingpin 
Operative 


The top matrix shows the payoff triples corresponding the strategy 
combinations where the NSA hunts the kingpin while the lower matrix 
are those for which the NSA hunts the operative. In general three 
player normal form games can be represented by matrices of payoff 
triples corresponding to each possible strategy for player 3. 


2. Solutions to Normal Form Games 


The goal of game theory is to predict which element of S will be 
chosen by the agents. In the the Prisoner’s Dilemma we argued for the 
plausibility of { confess, confess} and that either {kingpin, kingpin} or 
{ operative, operative} would be the result of the Terrorist Hunt. Now 
we layout the general principles that lay behind these predictions. 


2.1. Elimination of Dominated Strategies. A reasonable first 
principle for rational behavior in games is that agents should not play 
a strategy for which there exists an alternative strategy that raises 
her payoffs for all possible strategies by her opponent. To make this 
criteria more concrete, recall the Prisoner’s dilemma of Table 1. The 
essence of the solution discussed in the introduction is that player 1 
will never play “hold out” since it provides strictly less utility than 
“confess” for both possible choices by 2. Thus, we say that “hold out” 
is strictly dominated for player 1 by confess and predict that she will 
not play it. Similarly, “hold out” is strictly dominated for player 2 
as well. Thus, the only strategy combination that does not contain 
strictly dominated strategies is { confess, confess}. 


DEFINITION 5.1. (Strict dominance in pure strategies) A strategy 
s; is strictly dominated by s‘, for player i iff ui(si, 8—i) < ui(s), 8-;) for 
all s_; € S_;. 


DEFINITION 5.2. (Elimination by Strict Dominance in Pure Strate- 
gies) A strategy profile s = (s;,8_;) is a consistent with elimination by 
strict dominance if s; is not strictly dominated for any i € N. 
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It is also possible to tighten the predictions of elimination by strict 
dominance to note that it can be iterated as in the following example. 
Suppose that we have a normal form game represented by the following 
matrix: 


Table 5.6 
WY 
Up |10l 12 | 01 
Down | 0,3 0,1 2,0 | 


In this game agent | has no strictly dominated strategies. However, 
for agent 2, Right is dominated by Middle since Middle generates 2 
versus | against Up and 1 versus 0 against Down. If agent 1 recognizes 
that agent 2 will not choose Right, he will perceive the game as 


Table 5.7 
Middle 
12 


Down | 0,3 0,1 


In this reduced form, Down is now dominated for agent 1 by Up 
(payoff of 1 versus 0 for any strategy by agent 2). Since agent 2 
knows that agent 1 will play Up, she prefers “Middle”. Thus, { Down, 
Middle} is the solution consistent with iterated elimination of strictly 
dominated strategies. 


DEFINITION 5.3. (Iterated elimination of strictly dominated strate- 
gies) Given a normal form game T° = (N,S°,u®°) the process of it- 
eratively deleting strictly dominated strategies is attained through the 
following algorithm: fort =1,2,.... 

In period t arbitrarily select a player i‘ € N\i'~! and remove from 
Si! each strategy that is strictly dominated in the game I‘! Call the 
set of strategies that survive S;. Let Si = So for 7 € N\i* and let u® 
be the restriction of u, to S* for each z € N. 

If at tT there is no 17 € N having a strictly dominated strategy in 
the game T7~! then call the set S™! the set of outcomes that survive 
iterative deletion of strictly dominated strategies. 


t-1 


It can be shown that regardless of what sequence of players is cho- 
sen the same set of actions will be reached. A justification for this 
procedure is to consider agents who reason in the following manner. 
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I know that my opponents will not use strictly domi- 
nated strategies, and I know that my opponents know 
that I will not use strictly dominated strategies. Given 
this we are all really choosing from the smaller strategy 
space that survives the first n iterations. But I know that 
my opponents will not use a strategy that is strictly dom- 
inated in this game, and I know that my opponents know 
that I won’t play a strategy that is strictly dominated in 
this new game.,............. ad infinitum. 


While based on a much stronger premise, we can also use the idea 
of weak dominance to generate predictions from games. A strategy 
is weakly dominated if there is some other strategy that produces at 
least as high payoffs against all opponents strategy profiles and a higher 
outcome against at least one profile. 


DEFINITION 5.4. (Weak dominance in pure strategies) A strategy 
s; is weakly dominated by s‘, for player i iff ui(si,s—i) < ui(si, si) for 
all s_; € S_; and uj(s;, s_;) < ui(s;, s_;).at least one s_; € S_j. 


The definition of elimination by weak dominance is analogous to 
that of elimination by strict dominance. An important application of 
elimination by weak dominance in political science is in majority rule 
voting games. Assume that a set of n (odd) agents is voting between 
two candidates D and R. Each agent gets a payoff of 1 if her preferred 
candidate wins and 0 otherwise. We will also define the strategy sets 
so that s; = 1 is a vote for D and s; = 0 is a vote for R. Since the 
choice is by majority rule, the payoff for an agent who prefers D is 


wa fe 
0 otherwise 

and the payoff for an agent who prefers R is 1 — up. with this setup 
it is easy to see that no strategies are strictly dominated. Unless 
exactly nt agents choose s; = 1 and exactly not choose s; = 0, an 
agent’s utility is not effected by her individual choice. Thus, under 
most strategy profiles agents do not have strict prefers. However, since 
agents do have a strict preference at opponent’s profiles generating ties, 
voting for the preferred outcome weakly dominates voting for the lesser 
candidate. Thus, if we eliminate weakly dominated strategies, each 
agent votes for her preferred candidate and the candidate preferred by 
a majority wins. 

As attractive as solutions based on dominance are, they suffer from 
a number of problems. Perhaps the most important is that they have 
little bite in a number of games. Neither version of the Terrorist Hunt 
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contain dominated strategies. Thus, all strategy profiles are plausible 
if we only impose a dominance criterion. Secondly, iterated dominance 
imposed strong rationality requirements on the agents. The solution 
to the game described in Table 6 requires that agent 2 know for certain 
that agent 1 will not play Up because he is certain that agent 2 will not 
play Right. Any slip in this logical chain leads to other outcomes. On 
these grounds, elimination by weak dominance is especially vulnerable 
since the agents may only have a strict preference against a very small 
set of profiles and be indifferent against the rest. Thus, the prediction 
of weak dominance arguments may be based agents basing decisions 
on very low probability occurrences. 


2.2. Nash Equilibrium. As we discussed in the introduction, 
John Nash made a fundamental contribution to game theory by de- 
veloping a solution for normal form games that can be applied to a 
very large class of models. Nash’s solution requires that for all i € N 
agent 2’s strategy s; be a best response to the the strategies played by 
the other players s_;. 

One of the most important concepts in game theory is the best 
response correspondence.*® The best response to an opponent’s profile 
s_; is simply the set of strategies that maximize an agent’s utility when 
played against s_,; 


DEFINITION 5.5. The best response correspondence for agent i © N 
is a mapping b;(s_;) : S_; ~— Sj; defined as 

bi(s_;) = {8; € S; : uj(s;,s_4) > u(si,s_,) for every si € S;}for 
every 8_; © S_j. 


To make these abstract definitions, consider the best response cor- 
respondences for our examples. In the Prisoner’s dilemma, the best 
responses are: 


bi (confess) = {confess} 
bi(hold out) = {confess} 
bo(confess) = {confess} 
bo(hold ou £) = {confess} 


3For a discussion of correspondences, see the Mathematical Appendix. 
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Similarly, the best response correspondences for the two agency version 
of Terrorist Hunt are 


bi (kingpin) = {kingpin} 
bi (operative) = {operative} 
bo(kingpin) = {kingpin} 
b(operative) = {operative} 
In these examples, the best response is a point, but in many cases it 
will be a set of strategies. Recall the majority voting game of the 
last section. Unless the opposing profile generates an exact tie, voting 


for either candidate is a best response. Thus, formally best response 
correspondence for agent 7 preferring D is 


n—-1 


bi(s—i? Lisi = ——-) = 1 
bi(si: si —) = {0,1} 


Given the definition of the best response correspondence, we can define 
a Nash equilibrium as a strategy profile in which every agent is playing 
an element of her best response set against the strategies of the other 
agents. 


DEFINITION 5.6. A Nash equilibrium (in pure strategies) to a nor- 
mal form game is a strategy profile (s*) satisfying the condition: for 
everyi © N 

S; E b;(s*;) 


We can also state the definition without reference to the best re- 
sponse correspondence. 


DEFINITION 5.7. A Nash equilibrium (in pure strategies) to a nor- 
mal form game is a strategy profile (s*) satisfying the condition: for 
everyi €C N 


ui(s},8";) > ui(s;,s*,;) for every si, € S;. 


The concept of a Nash equilibrium (NE) is deceptively simple. We 
require that agents correctly conjecture what the other players will do 
and that they play a best response to this conjecture. An alternative 
interpretation based on the second definition is that at a strategy profile 
which is a NE no player has an incentive to unilaterally change her 
strategy. 

Now we can apply these definitions to our examples. 
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(1) The Prisoner’s Dilemma: Since {confess} is the sole element 
of the best response set for both agents against all outcomes, 
the unique Nash equilibrium is {con fess, confess}. 

(2) The Two Agency Terrorist Hunt: Since b;(kingpin) = {kingpin} 
for both agencies, {kingpin, kingpin} is a Nash equilibrium. 
Similarly, the fact that b;(operative) = {operative} suggests 
that {operative, operative} is also a NE. 

(3) Three Agency Terrorist Hunt: First verify that the best re- 
sponse correspondence is 


(kingpin, kingpin) = {kingpin} 
bi(operative, kingpin) = {kingpin} 
b;(kingpin, operative) = {kingpin} 


b;(operative, operative) = {operative } 


Using the first definition, it is easy to see that b;(kingpin, kingpin) = 


{kingpin} implies that {kingpin, kingpin, kingpin} is a Nash 
equilibrium. b;(operative, operative) = {operative} implies 
that {operative, operative, operative} is a NE. But the fact 


that b;(operative, kingpin) = {kingpin} and b;(kingpin, kingpin) = 


{kingpin} imply that there are no Nash equilibria where there 
are just two agencies pursuing the kingpin, even though coop- 
eration of two agencies is sufficient to capture him. 

4. Majority Voting Game: Here we claim that almost any strat- 
egy profile is a Nash equilibrium. First consider any pro- 
file such that 5° s; < it or 5) 5; > neh This implies that 
b;(s) = {0,1} for all 7. Thus, each such profile is a NE. Now 
consider > s; € ["5+,"4+]. Suppose that S>s; = “+. This 
oe is a Nash equilibrium if and only if all agents choosing 

= 1 prefer D. Suppose that this were not true and agent 
a ee s; = 1 but prefers R. However, since S> s-; = at. 
b; (s-;) = {0}. Thus, such an s is not a NE. Similarly, if 


s; = 1, s is a NE if all agents choosing s; = 0 prefer 
R. Thus, the set of NE includes every profile except those in 
which one candidate wins by a bare majority which includes a 


voter who prefers the losing candidate. 


It is interesting to note the similarity and differences between the set 
of Nash equilibria and the predictions of iterated dominance arguments. 
In one case, the Prisoner’s dilemma, the predictions are the same. In 
the Terrorist Hunt games, the set of Nash equilibria is smaller than the 
set of outcomes consistent with elimination of dominated strategies. 
However, in the case of majority rule voting, the set of Nash equilibria 
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is smaller than the set of strategies surviving strict dominance but 
much larger than the unique prediction of the elimination of weakly 
dominated strategies. 

The following theorems establish the exact link between NE and 
profiles surviving iterated dominance. 


THEOREM 5.1. If a strategy profile (si, ..., 8) 1s a Nash equilibrium, 
then none of its strategies can be eliminated through iterated dominance. 


PROOF. Suppose the theorem is false. Let s* be the first of the 
strategies in the equilibrium profile to be eliminated. That it is elim- 
inated requires that u;(s7,s_;) < u;(s;,s_;) for some s; and for all s_; 
that have not been eliminated. Since by assumption s*, has not been 
eliminated, u;(s¥,s*;) < ui(si,s*;). This violates the assumption that 
(si, ..., 8) is a Nash equilibrium. O 


THEOREM 5.2. If only the strategy profile (s},..., 8) survives iter- 
ated elimination of strictly dominated strategies then it is the unique 
pure strategy Nash equilibrium. 


PROOF. Uniqueness follows from the pervious theorem as any Nash 
equilibrium must survived iterated elimination. Now we show that the 
remaining profile must be a Nash equilibrium. Suppose (sj, ..., si) is 
not a Nash equilibrium. Then there exists si such that u;(s7,s*,;) < 
ui(si, s*,;) but si is eliminated by weak dominance. Let s; be the 
strategy that eliminates s‘. If s; = s*, we have acontradiction. Assume 
that s; A s. Then u;(s),s*,;) < ui(s;,s*,) since s; eliminates s7. We 
can continue this process, no more than a finite number of times until 
$, = 3) 5 O 


In all of our examples, there is at least one Nash equilibrium. How- 
ever, it is quite possible that there will be no strategy profiles satisfying 
the requirements of Nash equilibria. To see this consider the following 
game. Suppose that there are two armies, an defending army (D) and 
an invader (J). The invading army must decide whether to invade 
through the mountains M or to come through the plainsP. Similarly, 
the defenders must decide where to fortify its defenses in the mountains 
or those in the plains. If the invader attacks an undefended area it 
wins a payoff of 1, however it loses 1 if it attacks a fortification. Sim- 
ilarly, the defenders get 1 by correctly predicting the direction of the 
attack and loses 1 otherwise. Thus, this normal form game, known as 
Colonel Blotto, can be represented by the following matrix. 
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Table 5.8: Colonel Blotto 
DXT| Mt P 
M 1,-1 -1,1 
P -1,1 1,-1 


Note that bp (M) = {M}, bo (P) = {P}, 6r(M) = {P}, and 
b;(P) = {M}. A Nash equilibrium requires that there exist {sp, sr} 
such that by (sp) = 8; and bp (sr) = sp. Note that it is impossible for 
the best response correspondences of this game to satisfy these condi- 
tions. For any pair of strategies, one agent will have an incentive to 
choose a different one. Absent a Nash equilibrium (or any restrictions 
imposed by dominance), we lack a prediction for how this game will be 
played. 

In applications one generally defines a game and seeks to character- 
ize the set of NE (or some other set of strategy profiles). Accordingly 
in characterizing the equilibrium set, one is interested in existence and 
uniqueness. From the perspective of applied researchers, it is generally 
the case that a unique NE is most desirable as it means that we are 
analyzing a well specified model that makes clean predictions. The 
case of multiple equilibria is less desirable as it may mean that the 
model yields ambiguous predictions. The case of no equilibria may be 
very unsatisfactory as the model makes no predictions. 


3. Application: The Hotelling Model of Political 
Competition 


To provide an additional example of Nash equilibria, we turn one 
of the most widely used models in political game theory: the Hotelling 
(1927) model of political competition which was extended by Downs 
(1957). Suppose that a small town wants to decide where to build a 
school. Its citizenry are stretched out evenly along a one mile stretch 
of road and would like the school to be built as close to their homes as 
possible. Thus, we assume that the voter’s ideal points are distributed 
uniformly over [0,1]. The decision will be made following an election 
where two candidates compete for office by campaigning on promises 
of the school’s location. The winning candidate builds the school at 
the promised location and receives a payoff of 1. The losing candidate 
gets —1. In the case of a tie, we assume that the election is decided 
by a coin toss. ‘To keep the discussion simple, we will assume that 
the voters are not strategic agents in the game, but vote for the closest 
candidate. 
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Given our setup, the candidate’s strategy sets are Sy = Sp = [0,1]. 
Given that the voters vote for the closest candidate, we can compute 
the vote shares for both candidates for any strategy profile (s1, sz). 
Since the voters are distributed uniformly, the number of voters in any 
interval is equal to width of that interval. So if so > s,, all voters to 
the left of +4*2 vote for candidate 1 and her share is thus +4. The 
remaining 1 — *15*2 voters vote for candidate 2. Conversely, if s1 > so, 
candidate 2 receives a vote share “4 while candidate 1 gets the rest. 
Given the candidate’s utilities for winning, their payoff functions over 
strategies are: 


1 if 5; < sy and #3 > .5 or if s; > sy and 242 < 5 
ui(s1, 82) = ¢ Oif s; = 2 and 452 — 6 


—1 if 5; < sy and +5 < .5 or if 5; > sy and #42 > 5 


and 
ua(S1, 82) = —ui(51, 82) 
Our claim is that the unique Nash equilibrium is s; = s2 = .5. To 
demonstrate, we begin by computing the best response functions. We 
start with candidate 1. Suppose that so < .5, then candidate 1 can win 
for sure by choosing any platform generate a vote share of .5 or more. 
Thus, b;(s2) = (s2,1— 2). For sg > .5, similar calculations produce 
bi(s2) = (1 — $9, 2). If sg = .5, candidate 1 can at best generate a 
tie by choosing .5 as well. Thus, 6)(.5) = .5. Since candidate 2’s 
situation is entirely symmetric, her best response correspondence is 
by (51) = (s1, 1- $1) if 81 <.9 
by (51) = (1 — $1, $1) if $1, >.9 
bo (51) = Sj, if sy, = is) 

That sj = s} = .5 is a NE follows trivially since b;(.5) = .5 and 
bo(.5) = .5. The trick is to show that it is the only one. Suppose 
to the contrary that sj = s} # .5. However, this cannot be a NE 
since b; (s3) does not include s} = sj. Now suppose sj < si. Now 
bi (85) = (1 — 83, 85) while bp (st) = (sj,1— sj). Putting all of these 
conditions together, a NE requires that 1—s3 < sj and s3 < 1—s} which 
imply the contradictory inequalities that s} > 1 — sj and s3} < 1 — sj. 
Since the s} > s% is analogous, we have established the uniqueness of 
s} = s3 = .5 as a Nash equilibrium. 

A more intuitive proof of our claim follows from our second defi- 
nition of Nash equilibrium (a strategy profile for which no agent has 
a strict preference to deviate). Clearly, no candidate will defect from 
s} = 83 = .5 payoffs would fall from .5 to —1. Now consider any other 
possible equilibrium (sj, 53). If one candidate wins in this equilibrium, 
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the other candidate move to .5 and at least generate a tie. Thus, any 
other equilibrium must generate a tie. However, unless sj] = s} = .5, 
either candidate can move to .5 and win for sure. Thus, sj = s5 = .5 
is the only NE. 

Finally, we note that the same outcome is generated by applying 
elimination of weakly dominated strategies. Each candidate generates 
no worse than a tie by choosing .5. The tie occurs only if the opponent 
chooses .5 as well. Against this profile anything other platform will 
lose. Thus, s; = .5 weakly dominates all other strategies. 


3.1. Vote Maximizing Candidates. Suppose instead of max- 
imizing their chance of winning, each candidate maximizes his vote 
share. Thus, the payoffs are 


a if 81 < S89 
ur($1, 89) = 5 if S, = 82 


$1t+s2 ; 
Lee Bi 8D 


and 
ug(S1, 2) = 1 — uy (81, 82) 

Our claim is that s} = s} = .5 is again the unique Nash equilibrium. 
Again, let’s start with the best response correspondences. Suppose 
that sg < .5, candidate 1 would like to choose the smallest platform 
greater than so. However, since the strategy sets are continuums, no 
such platform exists. Similarly, if sy > .5, candidate 1 would like to 
choose the smallest platform less than s2 which does not exist either. 
Candidate 2 faces the same situation so that b;(s_; 4.5) = ¢. Now 
consider the best response to sy = .5. candidate 1 gets .5 for propos- 
ing .5 generating the tie and a strictly lower vote share for any other 
platform. Thus, .5 is the best response. Since candidate 2 faces the 
same incentives, b;(s_; =.5) = .5. So clearly s} = s} = .5 is a Nash 
equilibrium. Since the best response sets for any other strategy pair 
are empty, this is also the unique NE. 


3.2. Ideological Candidates. We now consider one last version 
of the Hotelling model. In this version, candidate are ideological in 
that they care about the policy implemented by the winning candidate. 
We will assume that candidate 1 wants the school to be as close to 0 
as possible so that her utility from the winning outcome zx is — |z]|. 
Similarly, candidate 2 would like the outcome to be as close to 1 as 
possible and so gets a utility of —|1— | from a school located at 2. 
In the case of a tied election, the voters flip a coin and the winner gets 
to implement her platform. Given that the candidates’ incentives to 
move policy to the extremes, it might seem that the outcomes would 
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no longer be located at the median voter. However, this is not the 
case as we show once again that sj = s} = .5 is the unique NE. 
First let us specify the payoff functions for each candidate. 


—|si| ifs; < sg and *5*2 > .5 or if 8; > sz and S52 <5 
ui(S1, $2) = —.5+|s;|—.5 + |s9| ifs; = s, and $2 = 5 
— |Se| ifs: < so and 32 < .5 or if 8; > sg and 43% > 5 


and 


—|1— | ifs; < sy and 25% > .5 or if 8] > s) and 4 
ta($1, 82) = —.5-|1—si|—.5-|1— 9] if sy = 52 and 5% = .5 
—|1 — 89| ifs; < sg and SO OL if s; > Sy and suse 


Now consider the best response functions. We begin with candidate 1. 
If sg < .5, no proposal less than s2 defeats s2. Thus, candidate 1’s best 
response is to choose s2 or a proposal that loses to sg. This implies that 
by (Sg < .5) = {82} U (1 — 59,1]. Alternatively, if s2 > .5, candidate 1 
will choose the smallest platform that defeats s2. However, just as in 
the last section, such a platform does not exist so that b; (s2 > .5) = ¢. 
By similar arguments, bz (5; > .5) = {s,}U[0, 1 — s;) and b (s1 < .5) = 
@. Finally, consider the best response to s; = .5. Any proposal by the 
other candidate loses for sure generating a utility of .5. Responding 
with .5 leads to a lottery over s; = S52 = .5 which has an expected value 
of .5 for both candidates. Thus, 6; (.5) = [0, 1]. 

Given these correspondences, it is easy to see that sj = s} = .5isa 
Nash equilibrium since .5 € 6; (.5) for both candidates. Now we show 
uniqueness. It’s clear that since b) (sg > .5) = @ and bp (s1 < .5) = 
@ the only possible candidates for NE are sj > .5 > s3. However, 
this condition in conjunction with 6b; (sy < .5) = {sg}U (1 — sg, 1] and 
bz (Ss; > .5) = {81 }U[0, 1 — s,) implies that s} > 1— 5 and s3} < 1—<sj. 
Since these last two inequalities are inconsistent, sj = s} = .5 is the 
only possible NE. 


4. Existence of Nash Equilibria 


As we saw in our Colonel Blotto example, there is no guarantee 
that a Nash equilibrium in pure strategies will exist. In this section, 
we explore the conditions necessary for the existence of Nash equilibria. 

First we consider a set of sufficient conditions for equilibria to exist 
in pure strategies. Before doing so we need an additional definition to 
describe an important property payoff functions: quasi-concavity. 


DEFINITION 5.8. A function f(x): X — R! with X a convex set is 
strictly quasi-concave if for anyt € R', x Ay € X and X€ (0,1) with 
f(x) >t and f(y) >t it is the case that f(Ax + (1 — A)y) >t. 
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Alternatively, a function is strictly quasi-concave if its upper con- 
tour sets are convex. Since we have already shown strictly convex 
preferences have singleton maximal sets (when the sets are non empty). 
This means that if a game has convex S and utility functions u;(s;, s_;) 
that are strictly quasi concave in s; for each s_; € S_; the best response 
correspondence will be a function (i.e. a single valued correspondence). 
This feature of the best response correspondences guarantees the exis- 
tence of a NE. In the next subsection we prove the following result. 


THEOREM 5.3. If the normal form game (N,S,u) satisfies the fol- 
lowing conditions: 

(1) S; is a conver and compact subset of a Euclidean space for each 
ie N. 

(2) uj(s;, 8_;) : S > R' ts a continuous function for each i € N 

(8) for every i € N and every s'_, € S_; the function u;(s;, s'_;) : 
S; — R! is strictly quasi concave 

a Nash Equilibrium exists. 


As useful as the previous result, it is obviously restrictive in that 
many games will not satisfy its assumptions. An alternative approach 
is to consider mixed strategies. A mixed strategies is a randomization 
over the pure strategies. We will denote a mixed strategy as a; and 
use 0; (s;) to denote the probability that agent i chooses strategy s;. 
The set of mixed strategies for player i will be the set of probability 
distributions over S; which we denote.A; = A(S;). 

Thus, a game in mixed strategies is very much like a game in pure 
strategies except that each agent chooses o; € A; rather than s; € S; 
and the players evaluate strategies according to expected utility over 
the lotteries induced by the mixed strategies. The following is a formal 
definition of a game in mixed strategies. 


DEFINITION 5.9. Given a normal form game T = (N,S,u) the 
mized extension game 1” =(N,A,u™)is constructed as follows: A; = 
A(S;) with an arbitrary strategy 0; € A; for alli € N and A = xienA; 
with o;(s;) denoting the probability that mixed strategy o; assigns to 
pure strategy s;. The expected utility function, U;(o;,0_i) : A > R' is 
defined as 

Vino) = S- > ui(S;, 8-4)04(5;), -<(s_,) for allt € N. 

s_j;ES_;4 siES; 

Since the mixed extension of a normal form game is itself a normal 
form game, our definition of NE applies directly to mixed extensions. 


To understand the mechanics of mixed strategy games, recall the 
Colonel Blotto game. Suppose now that each side can choose lotteries 
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over its strategies. So let 01 = 0,(M) be the probability that the 
defender protects the mountains and let 02 = 02 (M) be the probability 
that the invader attacks the mountains. Given these strategies, we can 
compute the expected utility for each player for each action. 


uz (M, 02) = 02 — (1-02) = 202-1 
ut (P, 02) = —02 + (1 — 02) = 1— 20% 
uz (01, M) = -01 + (1-01) = 1-204 
U2 (01, P) = 0, — (1-01) = 20, -1 


From these payoffs, it is straightforward to compute b; (72) and bz (01). 
Note that wu; (M,o2) > u1(P, 02) if o2 > 5, u1 (M, 02) = wu (P,02) if 
02 = 5, and ui (M,02) < wi(P,02) if o2 < 5. Thus, we can write 
the defender’s best response to all possible values of invader’s mixed 
strategy of o5.' 


M if o. > 3 
by (o2) = P ifo. <§ 
{M, P} if, =4 


Clearly, since b; (72) = {M, P} when o = $, any randomization over 
this set is also a best response. So we may also write bj (02) = o1 
il 62° = 5. Now we can use exactly the same process to compute the 
invader’s best response function. Note that 


U2 (01, M) > u2 (01, P) ifo. < 


u2(o1,M) < ug(o1, P) ifo. > 


NON tl NON Mell NON ee 


U2 (01, M) = ue (01, P) ifo, = 
Thus, the invader’s best response correspondence is 


Mito <t 
by (01) = at 4 a 
{M, P} or 0 if 0, = 3 


Now we can compute the Nash equilibrium. We know from our previ- 
ous discussion that there can be no Nash equilibrium where one agent 
plays any of its pure strategies. So the remaining combination to check 
is whether there is a combination of 0; and a2 that satisfies Nash’s cri- 


teria. Clearly, 7, is only a best response by the defender if a2 = ; 


4Note that the pure strategies of mountain or plains are just the special cases 
of og = 0 and og = 1. 
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and ¢2 is only a best response if a, = 5. Thus, 0, = ; and d2 = ; is 
a Nash equilibrium in mixed strategies. 

For a two player, two strategy game plotting best response functions 
is often useful for finding mixed strategy equilibria. Consider Figure 
5.1. The horizontal axis plots the defender’s mixed strategy raging 
from 0, = 0 (plains) to 7; = 1 (mountains). The vertical axis plots the 
invader’s mixed strategy from o2 = 0 (plains) to 72 = 1 (mountains). 
The solid line represents the defenders best response to o2 and the 
dotted line represents the invader’s best response. Note that the only 
intersection of these best response curves is at the Nash equilibrium 
o, =4% and 02 =35. 

Insert Figure 5.1 Here 

Mixed strategy equilibrium do not only exist in games without pure 
strategy equilibria. To see this, let’s return to the two agency Terrorist 
Hunt game represented in Table 5.2. Now let a; be the probability 
that the FBI hunts the kingpin and 02 be the probability the CIA does. 
Thus, we can compute the expected utilities for each agency for each 
action. 

ur(kingpin,o2) = 202 
ux(operative, a2) =1 
U2(o1, kingpin) = 201 
U2(o1, operative) = 1 
As before we can compare these utilities to generate the best response 
functions. 
kingpin if 72 > s 
bi (a2) = ¢ operative if a2 < 4 


o1 € [0,1] ifo. =} 


bo 


kingpin if 7, > ; 
bo (01) = ¢ operative if 0, < ; 
o2 € [0,1] if =5 

Figure 5.2 plots these best response functions as we did for Colonel 
Blotto. Now note that there are three intersections of {71,02}: {0,0}, {1,1}, 
and {s, i} . Of course the first two correspond to the pure strategy 
equilibria we have already computed, but the third is an additional 
mixed strategy equilibrium. 


Insert Figure 5.2 Here 


A feature of mixed strategy equilibria is that the probability that 
an agent plays a particular strategy is not a function of her preferences 
but those of her opponents. This is because an agent will only play 


90 5. GAMES IN THE NORMAL FORM 


mixed strategies is she is indifferent between a set of pure strategies. 
Thus, the opponent must be choosing its own mixed strategy to insure 
that the agent is indeed indifferent. Thus, its the preferences of the 
opponent that determine the mixing probabilities. Occasionally, this 
leads to counter-intuitive predictions. For example, suppose that we 
modified the Terrorist Hunt so that the CIA received a much higher 
payoff that the FBI for capturing the kingpin. We modify the payoffs 
as follows 


Table 5.9: Modified Terrorist Hunt 
Operative 
Kingpin 2,4 0,1 
| Operative 1,0 imal 


A naive prediction would be that since the CIA gets a higher payoff 
from the kingpin that it will be more likely to hunt him. This predic- 
tion is wrong. The reader can check that best response functions are 
now: 

kingpin if o2 > $ 
bi (72) = ¢ operative if a2 < 5 
o1 € [0,1] ifo. = 3 


kingpin if 0, > + 
bo (01) = 4 operative if 01 < 5 

o2 € [0,1] ifo, =F 
Figure 5.3 plots these best responses and reveals that the mixed strat- 
egy equilibrium is now {4,3}. Thus, the change in the CIA’s pref 
erences did not lead it to hunt the kingpin with a higher probability, 
it still hunts him $ of the time. However, the change actually de- 
creases the likelihood that the FBI will hunt the kingpin. Thus, the 
probability that the the operative will get caught in the mixed strategy 
equilibrium goes down. The logic behind this result is straightforward. 
In the mixed strategy equilibrium, the CIA must choose o2 to make the 
FBI indifferent between hunting the kingpin and the operative. Since 
the FBI’s preferences did not change, a2 does not change. Since the 
FBI choose a; to make the CIA indifferent, an increased utility for the 
kingpin means that the FBI must lower the probability of searching for 
the kingpin to maintain this indifference. 


Insert Figure 5.3 Here 


While mixed strategy equilibria do have some undesirable proper- 
ties, they are guaranteed to exist in games with finite strategy sets as 
per the following theorem. 
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THEOREM 5.4. (Nash) Given a normal form game T = (N,S,u) 
in which S' is finite, the mixed extension 1” = (N,A,u™) has at least 
one NE. In other words every finite game has a mixed strategy NE. 


The proof of this theorem (just as the last) utilizes some advanced 
mathematics (fixed point theorems) and are thus relegated to the ad- 
vanced section below. 


4.1. Dominance and Mixed Strategies. Now that we have de- 
fined mixed strategies, we can extend our definition of dominance to 
include mixed strategies 


DEFINITION 5.10. A pure strategy s; € S is strictly dominated if 
there exists ao, € A(S;) such that 


U;(o',, 84) > u;(s;, $_4) for every s_; € S_;. 


The strategy s; is weakly dominated if there is a o} for which the 
inequality holds weakly for every s_; € S_; and strictly for some s'_, € 
So 3.” 


The extension is straightforward in that we know allow strategies to 
be dominated by mixed strategies. As the following example show, this 
extended definition of dominance is much stronger that dominance in 
pure strategies as some strategies may be dominated by mixed strate- 
gies which are not dominated by pure strategies.° To see how a mixture 
may dominated a strategy undominated by pure strategies, consider the 
following game: 


| Table 5.10 


Note that neither player has any strategies which are dominated 
by pure strategies. However, consider a mixed strategy by player 2 of 
o2(L) = sand o2(R) = 4. This mixture has an expected value of 2.5 
when played against both U and D. This is a higher utility than player 
2’s pure strategy of IZ. Having extended the definition of dominance, 
we can now state the following relationship between mixed strategy 


equilibria and iterated dominance. 


Recall that U;(o4, s_;) = Vistes, Uil$j, $i) (Si). 

°Obviously, the converse cannot be true as any strategy dominated by a pure 
strategy is dominated by a mixed strategy placing probability one on the dominating 
strategy. 
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THEOREM 5.5. Given a finite normal form game with a mixed strat- 
egy NE o* if the strategy s; played with positive probability under o* 
then it survives iterated deletion of strictly dominated strategies. 


We leave the proof as an exercise. This theorem can be quite 
useful in computing mixed strategy equilibria. Instead of computing 
best responses to lotteries over all strategies, we need only compute 
those for mixtures of strategies surviving iterated dominance. 


4.2. Calculating Nash Equilibria. Even though we can specify 
sufficient conditions for the existence of NE, actually computing the 
equilibrium of a game is sometimes more art than science. In fact, 
there are games which we know have equilibria like Chess for which the 
actual equilibrium strategies have never been calculated. Nevertheless, 
there are a few tricks and algorithms which can facilitate computation 

4.2.1. Pure Strategy Nash Equilibria in Finite Games. We begin by 
outlining the process for checking whether a given profile is an equi- 
librium. Given a finite game, one way to characterize all of the pure 
strategy NE is to test whether each profile s’ € S is a Nash equilib- 
rium. To do this one starts with a profile s’ = (s/,...,s/,) and asks the 
following sequence of questions: 

1. Holding s%, ..., s’, fixed is there a strategy s for which u;(s{, s’_,) > 
ur(s}, 81). Ifso then s’ is not a Nash equilibrium. If not then continue 

2. Holding s/, s..., s/, fixed is there a strategy s} for which w2(s4, 5’) > 
U2(s5, 8/5). Ifso then s’ is not a Nash equilibrium. If not then continue 


2. Holding s‘,,...s/_1, 5/,,.., 8), fixed is there a strategy s for which 
ui(si’, s_;) > ui(si, s_;). Ifso then s’ is not a Nash equilibrium. If not 
then continue 


n. Holding s/,..., 5’, fixed is there a strategy s” for which u,(s”, s’_,,) > 
Un(si,, 8/,,). If so then s’ is not a Nash equilibrium. If not then s’ is 
a Nash equilibrium. 

This algorithm is then repeated for each profile in S. 

In two player finite games —which are representable by matrices— 
the algorithm is particularly straightforward. Start with a profile i.e. 
matrix entry and see whether there is an entry in the same column that 


makes the row player better off. If so then the original profile is not 
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a Nash equilibrium. If not then repeat the exercise interchanging row 
and column. Consider the following example: 


Table 5.11 


We begin by conjecturing that (t,/) is a pure strategy Nash equi- 
librium. Note that player 1 can only affect the row choice. Given 
that sg = | player 1 chooses between ui(t,/) = 5, wi(m,l) = 2 and 
ui(b,1) =5. Accordingly 6) (1) = {t,b}. Given s; =t player 2 chooses 
between utilities of 4, 3 and 2 so bo(t) = {I}. Accordingly (t,1) is a 
NE. Since b2(/) has a single element, we know that this is the only pure 
strategy NE in which ¢ is played. Recall that (J) = {t,b} so (m, 1) is 
not a NE as player 1 would deviate to either t or 6 if she anticipated 
player 2 selecting |. Now we conjecture that (m,c) is a NE and note 
that b;(c) = {m} and b2(m) = {c} and thus our conjecture is correct. 
Since r ¢ bo(m) we note that the only pure strategy NE in which m 
is played is (m,c). Now if we conjecture that (b,/) is a NE we will 
note that b)(b) = {r} and thus we see that our conjecture is incorrect. 
If we conjecture that (b,c) is a NE we note that bi(c) = {m} and so 
our conjecture is incorrect. Finally conjecturing that (b,r) is a NE we 
observe that neither player has an incentive to deviate and that the 
conjecture is correct. We thus conclude that the set of pure strategy 
NE is es L); (m, Ce); (b, ry. 


5. Pure Strategy Nash Equilibria in Non-Finite Games* 


In games where the strategy space is not finite, the algorithm ex- 
hibited above will not work. However, sometimes we can use the 
techniques of optimization to compute equilibria. If we assume that 
the utility functions u;(s) are twice differentiable, we can use simple 
calculus to compute best responses. Since a best response to s_; is the 
maximizer of u(s;, s_;) over 5;, a sufficient condition for s; € b; (s_;) is 


that i = 0 and Srus(sief) <0. The first of these conditions 


is termed the first order condition (FOC) while the second is known 
as the second order condition (SOC).’ Thus, when the FOC and SOC 


hold, we can use the solutions to 2#S+%=9 = 0 to solve for each of the 


7 Tf S, has more than one dimension then the term Bus (Sir8—1) is a vector where 


each coordinate is the partial derivative with respect to one coordinate of $;, and the 
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best response functions. Then a Nash equilibrium is the solution to 
the system 


St = bi(s*,) 


&, = vals @) 


However, this procedure can be unnecessarily cumbersome as it re- 
quires solving for each best response function as well as the system of 
equations. Often it is more convenient to solve the system of first 
order conditions directly 


Ou1(S1, s_1) 


Os1 at 
Ou, (S:, $4) =) 

Os; ai 
Ot ns Sah) =, 


OSn 


To guarantee that the solution is an equilibrium, we must check the 
second order conditions for each agent at the solution. 

If either system of equations has multiple solutions, there are more 
than one equilibrium in pure strategies. However, the absence of a so- 
lution does not necessarily imply that there are no pure strategy Nash 
equilibria. Since the FOC and SOC are sufficient but not necessary, 
there are many games in which the best responses do not satisfy them. 
Such situations can arise for a variety of reasons. In cases where pay- 
offs against s_; are either not quasi-concave or they are monotonically 
increasing or decreasing over S;, agent i’s best response will be at the 
boundary of S;. For such best responses, known as “corner solutions”, 
the FOC will typically not hold. A second situation in which the FOC 
approach will not work is when the payoff functions are discontinuous 
in s_;. Since the payoffs will not be differentiable at the discontinuities, 
we cannot compute the FOC or the SOC. 


quantity 0 denotes the vector of 0’s. The second order condition is the requirement 
that the matrix of second derivatives be negative definite. 
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6. Application: Interest Group Contributions 


To demonstrate this procedure for computing Nash equilibria, we 
consider a simple interest group game. Two interest groups NV = {1,2} 
want to influence a government policy. Both groups know that the 
final policy will be a function of how much support they give to the 
government. The first group’s most preferred policy is 0 and the 
second group’s most preferred policy is 1. The government favors the 
policy 5 but may be influenced by the contributions. Each group 
can contribute an amount s; € [0,1] and the final policy is given by 
x(S1, 82) = ; —8,+ 5. So both groups contribute to the government 
simultaneously and then the government enacts the policy given by the 
function x(51,52). The government keeps all of the contributions to 
buy advertisements for the next election.* We assume that the interest 
groups each have utility functions over their contribution and the final 
policy of the form: 


U1(S1, 82) = —(x(s1, 82))? — Sj 
u2(81, 82) = —(1 — 2(s1, 82))? — 82 


Substituting the policy function into the utility functions we attain: 


u1(S1, 52) = = — sy+ 82)? — Ss, 
1 
U2(S1, $2) =-—(1- 5 —sy+ 82))? — 89 


The first order conditions are given by differentiation: 


1 
FOC: 2(5 — 81 + 52) -1=0 
1 
FOC} : 2(1— (5 — 81 + $2))-1=0 


Solving FOC, yields the best response correspondence 


by (so) = S89. 
Solving F'OC, yields the best response correspondence 
bo(s1) = §1. 


Now the set of pure strategy NE to this game is infinite and it is given 
by: 
{(s1, 82) € [0,1]? : 51 = 2} 


8This game is closely related to the all-pay auction in economics. 
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This result has a very straightforward interpretation. Any pair of 
equivalent contributions is a Nash equilibrium. The resulting policy 
from such a profile of contributions is 5. No contributor wants to 
unilaterally deviate because the marginal gain of an additional unit of 
contribution (in terms of pulling policy in the desirable direction) is 
exactly offset by the marginal cost of loosing another unit of resources. 
As in the Prisoner’s dilemma, the Nash equilibria of this game are 
inefficient. That is, the contributors would rather commit to not giving 
any money to the government. However, since no such commitment 
is possible in the game, the fact that each contributor has an incentive 
to deviate from such an agreement means that this outcome is not 


supportable. 


7. Application: International Externalities 


Suppose that we have two countries who must decide how much to 
invest in pollution abatement. The strategies are a levels of investment 
s,; and sy > 0. Each country pays a cost c(s;) = k;s; We will assume 
that ky < k2 so that country 1 can lower pollution a given amount at a 
lower costs than country 2. The payoffs to the investments are based 
on the total investment made so that wu; (51, $2) = \/s1 + $2. Therefore, 
the payoffs for each country are 


V8, + $2- kj8; 


Given these payoffs it is straightforward to get the FOC conditions: 


1 oe 
5 (81 + 52) 2—k, =0 
1 


(sy + s2)2 = ko = 0 


i) 


while the SOCs are both —4(s; + 52)? < 0. However, note that 
there can be no solution to the system of FOCs since k; < ky. Suppose 
that s; > (2k:)~7, the left hand side of country 2’s FOC is always 
negative. This implies that country 2’s payoff are always decreasing 
in its investment. This is shown graphically in Figure 5.4 which plots 
country 2’s payoffs as a function of country 1’s investment. Since all 
investments are assumed to be non-negative, country 2’s best response 
to this level of country 1 investment is to invest zero. Similarly, if 
$9 > @kiy country 1’s best response is zero investment. Since these 
“corner solutions” are part of each countries best response functions, 
we may not use the FOC approach. 


Insert Figure 5.4 Here 
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It is easy to see the actual solution graphically. Figure 5.5 plots 
the best response functions for both countries. The vertical axis rep- 
resents both country 2’s strategy and its best response to country 1. 
Similarly, the horizontal axis represents country 1’s strategy and best 
response. ‘The dotted line plots country 2’s best response function 
which is declining in s; until s; > (2ke)? and then it is zero. The 
solid line is the country 1’s best response which also declines to zero at 
$2 = (2k,)? . Clearly, the only intersection of the best responses is at 
the point where s; = (2k)? and s» = 0. 

Insert Figure 5.5 Here 


If we assumed that instead of ky < kg that country 2 is the low 
cost country, it is easy to check that the Nash equilibrium strategies 
would be s; = 0 and sy = (2k2) 7. Thus, the unique Nash equilibrium 
of this game is one where the high cost country free rides completely 
on the low cost country. Thus, the low cost country chooses optimal 
investment knowing that the high cost will invest nothing. 


8. Computing Equilibria with Constrained Optimization* 


A useful alternative when a game may have corner best response 
functions is to use the techniques of constrained maximization such 
as Kuhn-Tucker programing to compute necessary conditions for Nash 
equilibria. To illustrate, consider the externality game where we im- 
pose the constraint that s; > 0 and sy > 0. We can formally incor- 
porate these constraints with Lagrange multipliers 4, and A2 so that 
agent 7 chooses s; to maximize 


V8, + 82 — ks; + Aj Si 
Now the necessary conditions for a Nash equilibria are the first- order 
conditions 


5 (s+ 52)"4 —ky+A,=0 
i = 
5 (81 + 52) 2— ko +r. = 0 
along with the “slackness” conditions 
A181 =) 
A282 = 0 


and the constraints: 
A; > 0 for all 7 
8; > 0 for all 2. 
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Thus, a Nash equilibrium is a solution to the four equations that satisfy 
the constraints. Note that the first two equations imply that A; = 


ky = 5 (sy + S$) 


1) 


. We can use rewrite the slack conditions as 


and the constraints on \ as 


ky > 


ko > = (51+ 82)? 


It is easy to see that there can be no solution in which both countries are 
making positive investments because this would require the bracketed 
terms of each FOC to be zero which they cannot be. It is also easy to 
see that s; = so = 0 is not an equilibrium because the requirement that 
ky > 5 (81+ 89)? would be violated. Suppose that s; = 0 and s2 > 0. 


Then the second first order condition implies that ky = $ (s; + 82)? eS 
k, which violates the non-negativity constraints on A,. Thus, the only 
possible NE involves s; > 0 and sg = 0. The first FOC implies that 


s* = (2k,) 7 which exactly the result we derived in the last section. 


9. Proving the Existence of Nash Equilibria** 


In this section, we provide a rigorous proof of Nash’s Theorem. 
But first we need to develop some additional mathematical ideas. The 
most important is that of a fized point. Intuitively, a fixed point of a 
function (correspondence) is a point (set) in the domain that maps into 
itself in the range. Formally, given a correspondence. c: A— Aa 
fixed point «* € A is a point such that x* € c(x*). If c(-) is a function 
then a fixed point is a point x* such that x* = c(xz*). Note that since 
a Nash equilibrium is a strategy profile s* for which s* € 6;(s*,) for 
every 7 € N, it is a fixed point of the the best response correspondence 


b(s) = (b1(s-1), «+; Bi($—a), ---, On(S—n))- 
Thus, proving the existence of a Nash equilibrium is simply a matter of 
determining whether or not a fixed point exists for a given best response 
correspondence. Fortunately, there is a large body of mathematics 
dedicated to determining the sufficient conditions for the existence of 
a fixed-point. A specification of such conditions is known as a fixed 
point theorem. Thus, if we can demonstrate that the properties of 


9. PROVING THE EXISTENCE OF NASH EQUILIBRIA** 99 


the best response correspondence for a game match the conditions of 
an established fixed point theorem, we have proven the existence of a 
Nash equilibrium. 

Before proceeding, we need to elaborate some of the conditions that 
will be required for the existence of a fixed point. The first is that we 
require that the correspondence be convex valued. 


DEFINITION 5.11. A correspondence c: A —— A is convex valued 
if for every a € A, C(a) is a conver subset of A. 


In other words a correspondence c(-) is convex valued if for every 
x € Aify,z € c(x) then for any \ € [0, 1] the point Ay+(1—A)z € c(z). 
Now we define the upper inverse of a set B which is the set of points 
in the domain which the correspondence maps into subsets of B. 


DEFINITION 5.12. For a correspondence c : A —-—> A, the upper 
inverse of a set B C A, is ct(B) ={x € A: c(x) C Bh. 


We can define upper hemi-continuity, an important property for 
establishing the existence of a fixed point. 


DEFINITION 5.13. A correspondence c: A —— A is upper hemi- 
continuous if for every open set O C A the set ct(O) is open. 


Definition 5.13 is often hard to verify. The existence of a closed 
graph is easier to check. 


DEFINITION 5.14. A correspondence c : A —— A has a closed 
graph if for any two sequences x” — x € A andy” > y € A with 
x” € A andy” € c(x”) for every n we have y € c(z). 


The following theorem (whose proof we leave as an exercise) es- 
tablishes that correspondences with closed graphs are upper hemi- 
continuous. 


THEOREM 5.6. If A is compact a correspondence c: A -— A is 
upper hemi-continuous if it has a closed graph. 


The intuition behind the closed graph condition is not difficult to 
see. When a correspondence has a closed graph, if we have two se- 
quences x” and y” of points each in A that converges to x and y both 
in A with y" € c(x”) it must be the case that y € c(x). In other words 
for any sequence in the domain converging to x and any selection of 
points y” that are in the image converging to a point y, it must be the 
case that the limit y is in the image of the correspondence evaluated 
at the limit of x. Graphically for a correspondence that has a closed 
graph, the set {(x,y) € A? : y € c(z)} is closed in the space A?. For 
a more complete treatment of these concepts see Border (1989). 


100 5. GAMES IN THE NORMAL FORM 


It is not difficult to see that if the correspondence c: A —— A is 
single-valued for every a € A then c(-) is a function. If a single valued 
correspondence is upper hemi-continuous then it is also a continuous 
function. 

To establish Theorem 5.3, we will use the following fixed point 
theorem 


THEOREM 5.7. (Brouwer) Suppose A C R¢ is a compact and convex 
set. If f: A— A is a continuous function then f(-) has a fixed point 
in A. 


To establish Theorem 5.4, we will use the following fixed point 
theorem. 


THEOREM 5.8. (Kakutani) Suppose that A Cc R¢ is a compact and 
convex set with c : A +— A a correspondence satisfying the conditions: 

(1) c(x) is non-empty for every x € A 

(2) c(-) is convex valued 

(3) c(-) is upper hemi-continuous 

then c(-) has a fixed point in A. 


Several proofs of these results appear in Border. In order to es- 
tablish the existence of Nash equilibria in either mixed strategies or 
pure strategies when the appropriate assumptions are satisfied, we will 
need to show that in the case of Theorem 5.7 b(s) is a continuous 
function and in the case of Theorem 5.8 b(s) is a correspondence that 
satisfies the conditions 1-3. A result that is useful in its own right, 
as well as helpful in demonstrating that b(s) is non-empty and upper 
hemi-continuous is the Theorem of the Maximum. 


THEOREM 5.9. (Theorem of the Maximum) Let X Cc R4,M Cc R’ 
be compact and convex sets. Let f(x,m):X x M — R' be continuous 
in x and m then the correspondence c: M —— X defined as 


c(m) = arg max{ f(x,™)} 


is non-empty and upper hemi-continuous. 


The fact that the set of optimal choices is non-empty is interesting 
on its own. This result was stated in a previous section. The fact 
that the correspondence c(-) defined in the theorem is upper hemi- 
continuous has the following interpretation. Calling the vector m a 
parameter vector of the optimization problem, if we consider a sequence 
of parameter vectors m” converging to m then for any selection of 
optimal policies x” € c(m”) that converge to x it will be the case that 
£ Ee c(m). 
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PROOF OF THEOREM 5.3. Assume that: 5S; is a convex subset of 
R¢ for each i € N and for each i € N, u;(s) : S — R? is continuous 
and for each s'_; € S_;, u;(s;, s'_;) is strictly quasiconcave in s;. By 
the Theorem of the Maximum the correspondence b;(s_;) : S_; > S$ 
defined as 


bi(s_;) = arg max {ui (si, s—s) } 


is non-empty and upper hemi-continuous. By theorem 4 of chapter 
2, b;(s_;) is a singleton for every s_; € S_;. The fact that an upper 
hemi-continuous correspondence that is single valued is a continuous 
function (see Exercise 5.7) implies that b;(s_;) is a continuous function 
from S_; into S; for each i € N. We now construct the function 


b(s): SS 


by defining 6(s1,..., $n) = (b1(s_1),...-;bn(S—n)). Since s_;(s) is a pro- 
jection it is continuous. Since 6;(-) is continuous and the composition 
of continuous functions is continuous, and the product of continuous 
functions is continuos in the product space the function b(s) is con- 
tinuous. By Brouwer’s fixed point theorem this mapping has a fixed 
point, s* = b(s*). But this means that for every i € N, b;(s*;) = s* 
and thus s* is a NE. 


The proof of Theorem 5.4 is similar. We will show that for any 
finite game [ in the mixed extension game I” the best response corre- 
spondence satisfies the conditions of Kakutani’s fixed point theorem. 


PROOF OF THEOREM 5.4. Assume that in the game I’, S; is finite 
for each i € N. This implies that in the mixed extension I” A; is 
a compact and convex subset of a finite dimensional Euclidean space. 
By definition 5.9 we can see that U(o;,a_;) is linear and therefore 
continuous ino. Letting b;(7_;) : A_; ~— A, be defined as 


b;(o_;) = arg nae {U(oi,0-i)} 


the Theorem of the Maximum implies that this correspondence is non- 
empty for every o_; € A_; and upper hemi-continuous. Since U(o;,0_;) 
is linear for any o_; and any two o/, 0" if U(o},0_;) = U(ol,0_;) we 
have U(Ao}, + (1 — A)oy’, c_-;) = U(o4,0_-i) so b;i(o_;) is convex valued. 
Combining these facts we see that the correspondence 


bio): AA 
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defined as b(04,...,0n) = (bi(7-1),...-;6n(7-n)) satisfies the require- 
ments of Kakutani’s fixed point theorem. Thus there is a mixed strat- 
egy profile satisfying the condition o* € b(o*). Such a profile is a NE 
to [” and thus a mixed strategy NE to I. O 


10. Strategic Complementarity 


Consider two candidates 1 and 2 running for office. Each candidate 
selects a level s; > 0 of campaign effort. The opportunity cost of this 
effort is 6,5; where 3; > 0. We assume that the election outcome is a 
probabilistic function of s; and sy. Let (51,52) denoting the proba- 
bility that candidate 1 wins so that candidate 2 wins with probability 
1 — (51,82). It is reasonable to think that this probability function 
is increasing in s; and decreasing in s2._ In formulating a reasonable 
model of campaigns, we confront an important question: should can- 
didate 1’s best response be increasing or decreasing in candidate 2’s 
effort? It seems natural to think that candidate 1 will select higher 
levels of s; in response to higher levels of sy and vice versa. If this 
is the case, we say that the two choice variables are strategic comple- 
ments. Games with strategic complementarity are among a class of 
games known as supermodular and are particularly amenable to equi- 
librium and comparative static analysis. In this section we sketch the 
intuition behind the analysis of such games using an example. We 
leave the technical details for the subsequent section. 

Kahn and Kenney’s (1999) study of Senate elections posits that 
competitiveness is an important factor in determining how much me- 
dia coverage campaign activities will generate. The competitiveness 
shapes the way that voters respond to campaigns while campaigns in- 
fluence the competitiveness of a race. Absent high competitiveness, 
the media and voters tune-out and thus the marginal value of advertise- 
ments or speeches is low. On the other hand when the competitiveness 
is high these messages have large effects. Kahn and Kenney’s theory 
portrays the media as an mechanism that determines competitiveness 
as a function of campaigning. Consistent with this interpretation, 
higher levels of campaigning are likely to result in higher states of com- 
petitiveness. In highly competitive races advertisements and speeches 
are more influential. The feedback loop that Kahn and Kenney discuss 
suggests that candidate effort levels may indeed be strategic comple- 
ments. Ifso, each of the best responses are strictly increasing functions. 
Figure 5.6 depicts a game of this form with a unique equilibrium point 
(s}, $9). 


Insert Figure 5.6 
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While pictures with multiple equilibria could be drawn, it is not 
difficult to see that the equilibria will all be completely ordered such 
that if (s},s3) and (sj*,s5*) are two Nash equilibria and sj > s}* then 
s5 > s3* (and if s§ > s$* then sj > sj"). To see that this must be true, 
try drawing a counterexample while maintaining the assumption that 
the best responses are strictly increasing. 

We can take Kahn and Kenney’s hypothesis about competitiveness 
further. Suppose at the beginning of a campaign, there is an ex- 
ogenous level of competitiveness. Sources of variation might include 
the importance of office, the media environment, and the attentiveness 
of the voters. So consider two electoral environments where one is 
more competitive than the other. For a fixed level of s;, we would 
expect that b_;(s;) is higher in the more competitive race. In Figure 
5.6, bi(s_;) denotes the best responses for the more competitive while 
b;(s_;) represents those of the less competitive election. The figure 
demonstrates that the equilibrium campaign efforts of the more com- 
petitive race (s{", 55") are higher than the equilibrium efforts of the less 
competitive race, (sj, 3). Thus, we can conclude that competitiveness 
generates more campaigning. 

It is crucial to note that this comparative static result was generated 
by nothing more than the assumption of strategic complementarity. As 
long as both best responses are increasing, a common upward shift in 
the best responses must lead to a higher intersection point. It is easy 
to see that the effect of competitiveness would be ambiguous if one 
of the players had a downward sloping best response Furthermore, 
as long as candidate efforts are strategic complements, the result does 
not hinge on any further assumptions about functional forms or player 
preferences. It also obviates the need for the continuous and differen- 
tiable best response functions required for use of the implicit function 
theorem. 

The trick, of course, is to determine what model primitives are 
consistent with complementarity. In the following technical section, we 
present the underlying theory behind supermodular games and present 
some sufficiency conditions for the existence of equilibria exhibiting 
monotone comparative statics. 


11. Supermodularity and Monotone Comparative Statics* 


Recall that Nash’s theorem relies on the continuity of best responses 
and compactness of strategy sets so that Kakutani’s fixed point theo- 
rem ensures the existence of equilibria. Unfortunately, these existence 
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requirements often require very strong assumptions about the primi- 
tives of the model. However, we can use a different fixed point theo- 
rem to establish existence so long as the best responses satisfy certain 
monotonicity conditions, such as the ones discussed in the last sec- 
tion. As a bonus, the comparative statics analysis of the equilibria set 
is immediate when the best responses satisfy these conditions. These 
analyses are much simpler and straightforward than those based on the 
implicit function theorem. In this section, we summarize several re- 
sults on a class of games known as supermodular games. Of particular 
interest is the existence of equilibria with discontinuous best response 
correspondences and direct comparative static results. 

In principal the concepts of this section can be developed for any 
partial orderings, with the goal being the attainment of results about 
monotonicity relative to the partial order. To keep things concrete we 
will consider only the natural partial ordering > on sets contained in 
R”. For any two numbers z,y € R' the join is x Vy = max{z, y} 


and the meet is x Ay = min{z,y}. For any two vectors in 2,y 
in R” we have x V y = (max{m, yi},...,max{%n, yn}) and rAy = 
(min{x1, yi}, ....,min{%p, yn}). A set which contains all of the joins 


and meets of its elements is called a lattice. 


DEFINITION 5.15. A set A is a lattice if for each x,y € A we have 
rVye€AandrAyEeaAa 


Note that intervals and the products of intervals are lattices, but 
sets like {2 € R® : 2; +22 +23 < 1} are not lattices.? Intuitively, 
squares (products of intervals) are lattices but triangles (simplices) are 
not.!° Typically we are interested in single agent or multi-agent deci- 
sion theory problems in which there is a set of choice variables X and a 
set of exogenous parameters P, both of which are lattices. The objec- 
tive function depends on both types of variables, f(x, p) : X x P > R!. 
The key condition that we now focus on is called supermodularity. 


DEFINITION 5.16. The function f(-,-) : Xx P — R'is supermodular 
in (a, p) if for all z,z' EX xP, fiz) + fle) <feve)+fEaz) 


While verification of this condition may be difficult, a more intuitive 
condition, increasing differences, is often easily verified. 


*But see page 14 of Topkis 1998 for a dicusssion of translations that convert 
sets like the simplex into lattices. 

0 Although, we do not consider examples where the sets are discrete (or just 
not convex) the definition of a lattice and subsequent results can be readily applied 
to non convex sets. 
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DEFINITION 5.17. The function f(-,-) : X x P — R' has increasing 
differences in (x,p) if for allp < panda <2’, f(a',p') — f(a,p') = 
f(a’, p) + TE: P): 

A function with increasing differences exhibits a greater marginal 
effect of x when p is higher. In other words increasing difference for- 
malizes the idea of complementarity. Since increasing differences is 
easier to interpret in terms of the substance of models than is super- 
modularity, it is convenient that the following equivalence holds. 


PROPOSITION 5.1. (Topkis) The function f(-,-) : X x P — R! 
has increasing differences in (x,p) if and only if it is supermodular in 
(x, p). 

In the case of a twice differentiable function f(-,-):X x P > R’, 
supermodularity coincides with the condition 

OF 
021022 = 
for any coordinates z, and z2. of X or P or both. Supermodularity is 
preserved under several types of operations. 


PROPOSITION 5.2. Suppose that X is a lattice then 

(a) if f(x) is supermodular on X and a> 0 then af(x) is super- 
modular on X. 

(b) if f(a) and g(x) are supermodular on X then f(x) + g(a) is 
supermodular on X. 

(c) if f(x) is supermodular on X for t = 1,2,... and f(x) = 
limyo fx(x) for each x € X then f(x) is supermodular in X. 

(d) if F(w) is a distribution function on a set Q and g(x,w) is 
supermodular on X for each w € Q then f(x) = Jo g(t,w)dF(w) is 
supermodular on X. 


We first present the implications of supermodularity in the simple 
case of an individual optimization problem. Supermodularity implies 
that the optimal solution has monotone comparative statics. By this 
we mean that each element of arg max,¢x f(x,p) is an increasing func- 
tion of p. Specifically we have the following result. 


PROPOSITION 5.3. (Topkis) If f(-,-) : X x P > R' is supermodular 
in (x,p) and g(p) = argmaxzex f(x, p) then each component of g(p) is 
increasing in p where g(p) exists. 


Of course nothing in the theorem says that an optimal solution 
exist. Recall that 2.3 establishes the non-emptiness of the maximal 
set for lower continuous orderings on compact sets. Lower continuity 
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of preferences corresponds to what is called upper semicontinuity of 
objective functions. 


DEFINITION 5.18. A function f(x) : X — R! is upper semicontin- 
uous at to € X if for alle > f(axo) there is some 6 > 0 such that for 
all x € B(ao,6) (an open ball around xo with radius 6) ¢ > f(x). A 
function is upper semicontinuous on X if it is upper semicontinuous at 
every Lo € X. 


Upper semicontinuity at x9 requires that liminf,_.., f(x) < f(xo). 
Using the result of Exercise 5.11, Theorem 2.3 and restating Topkis’ 
result leads to the conclusion. 


PROPOSITION 5.4. If f(-,-) : X x P = R! is supermodular in (x, p) 
and upper semicontinuous then g(p) = arg max,zcx f(x, p) is non-empty 
for every p € P and each component of g(p) is increasing in p. 


In our argument for existence with continuity, we used properties of 
the objective functions to establish properties of the best responses and 
then applied a fixed point theorem to the best response correspondence. 
Analogously Tarsky’s fixed point theorem allows us to use monotonicity 
of best responses to establish existence of Nash equilibria. This and 
the previous proposition then lead to the conclusion that a game with 
supermodular and upper semicontinuous utility functions has a Nash 
equilibria. We first present Tarsky’s theorem. 


PROPOSITION 5.5. (Tarsky) If f(x) is an increasing mapping from 
a compact lattice X into itself then there exists at least one fixed point 
e* SOCh abe) Se 


Insert Figure 5.7 


In Figure 5.7 we demonstrate the case where X is one dimensional. 
As long as the function f (2) is increasing the presence of discontinuities 
does not enable us to skip all of the crossings of f(z) and the dotted 
45° line. Just as Kakutani’s theorem generalized Brouwer’s to the case 
of correspondences, Zhou (1994) represents a generalization of Tarsky 
to the case of correspondences. We do not present this result as much 
additional notation is needed, but instead present directly the result 
for supermodular games. A regular supermodular game is a normal 
form game (N, {S;, u(-,...,-)}ien) in which S; is a compact lattice for 
each i € N and u(-,...,-) is supermodular and upper semicontinuous 
for each i € N. 


PROPOSITION 5.6. A regular supermodular game has at least one 
pure strategy Nash equilibrium and the set of such equilibria is a lattice 
with a smallest and biggest equilibrium profile. 
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Why do we care about the existence of a smallest and biggest equi- 
librium? Our second interesting result for supermodular games deals 
with comparative statics of these particular equilibria. To motivate 
the finding we return to the case of a function f(x) on R!. 


Insert Figure 5.8 


In Figure 5.8 we depict an increasing function with four fixed points 
f(a) =a. The dotted curve f’(x) represents the result of shifting f(x) 
up. Note that the biggest and smallest fixed points shift to the right 
(get larger) when the function is shifted up. In contrast some of the 
fixed points move the other way. The smallest and biggest fixed points 
tend to be the result of the function crossing the 45° line from above and 
behave the same way. This intuition forms the basis for the following 
comparative static result. 


PROPOSITION 5.7. In a regular supermodular game the smallest and 
biggest equilibrium profiles are increasing in p. 


As Figure 5.8 suggests other equilibria may behave differently. Echenique 
(2002) shows that in games like this with complementarities any selec- 
tion of equilibria that does not also exhibit the monotone comparative 
static of the biggest and smallest equilibria will be unstable under a 
large range of adaptive dynamics. In other words there are clear rea- 
sons to select equilibria which exhibit the comparative static described 
in the proposition. 

Returning to the discussion of Kahn and Kenney’s competitiveness 
hypothesis, suppose that candidate 1 wishes to maximize 7(51, 52, c) — 
(18, and candidate 2 wishes to maximize 1 — 7(s1, S2,c) — 3252 where 
m(81,$2,C) depends on an exogenous level of competitiveness c € R’. 
The fact that campaign activity influences competitiveness and that 
voters pay closer attention in more competitive races suggests that the 
incremental effect of s; on 7(51, 82,c) should be higher when s2 and/or 
c are higher. A symmetric argument follows for the incremental effect 
of 53. Accordingly the assumption that payoffs are supermodular is 
consistent with (if not implied by) Kahn and Kenney’s explanation. 
Assuming compactness of the choice sets and upper semicontinuity of 
1(81,82,C) in 8; and —7(sj, S2,c) in sg allows us to conclude (from 
Propositions 5.6 and 5.7) that equilibria exist and in the biggest and 
smallest equilibria s; is increasing in c. This may seem like a pretty 
strong conclusion to reach without having to specify very much about 
the function 7(-,-,-). The power of monotone comparative statics is 
that they tend to isolate the minimal structure that is needed for a 
particular comparative static result. 
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For a thorough review of results for supermodular games and monotone 
comparative statics see Topkis (1998). A focused summary of results 
and applications to political science appear in Ashworth and Bueno de 
Mesquita (2004). 


12. Refining Nash Equilibria 


Recall the majority rule voting game of section where we showed 
that any profile which no single voter is pivotal is a Nash equilibrium. 
We justified ignoring these equilibria on the basis that they involved 
playing weakly dominated strategies. _ However, the elimination of 
weakly dominated strategies is not the only way that we can justify 
“refining” the set of Nash equilibria in this game. Suppose instead of 
assuming that every agent is capable of playing her best response with 
probability one, we assume that there some small probability that each 
player will “tremble” and play another strategy. To keep things very 
simple, lets look at a three player version of the game where 2 voters 
prefer D and one prefers R. Formally, we will assume that each player 
must play each pure strategy with a small probability ¢ which must 
be less than 3.’ This captures the idea that mistakes ensure that all 
strategies are played with a least a minimal probability. 

Clearly it is a best response for each agent to attempt to maximize 
the probability of their preferred candidate winning which is the same 
as minimizing the probability that the least desired candidate wins. 
So we need to compute the probability that each candidate wins under 
various combinations of strategies. The probabilities of a R victory 
are: 


Pr (R|3 attempted votes for R 
Pr (R|2 attempted votes for R 


( =(1—¢)?+3(1-¢)’e 
( 

Pr (R|1 attempted votes for R 
( 


=e) ey Boe) e" 
=e (ez) oO (Sere tie? 
=e + 3e7(1 —€) 


Ce 

The reader should verify that since ¢ < s, the probability that a 
Republican wins is strictly increasing in the number of intended votes. 

First, we consider the “bad” equilibrium where R wins with unan- 
imously. Does this outcome survive in the presence of trembles i.e. 
does each voter vote R with the maximal probability 1 — ¢? Clearly, 
the R preferring voter will maximize the probability of an R victory by 
voting R with probability 1— ©. However, consider the choice of a D 


NA NNN 


Pr (R|0 attempted votes for R 


The idea that voters might not vote for the candidate that they intended has 
a renewed substative importantance after the 2000 presidential election. 
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voter? She has the choice of conforming the equilibrium by intending 
to vote R in which case R wins with probability 1 — ¢)? +3(1—¢)%« 
or defecting to an intended vote of D in which case R wins (1— ¢)* + 
(1 —e)?e+2(1—e)e?. This defection reduces the probability that R 
wins by 2(1—¢)*«—2(1—e)e? > 0. Thus, the D voter will prefer 
the defection. It is easy to show that the equilibrium corresponding 
where all voters vote D also does not survive trembles. 

This concept of refining the set of equilibria by focusing on those 
that are robust to small mistakes by the agents was first developed 
by Selten. He named such equilibria perfect. A formal definition of 
perfect equilibria follows: 


DEFINITION 5.19. An “e-constrained” equilibrium is a totally mixed 
strategy profile o© such that for each player i, o§ solves max,,u;(o;, 0~ ;) 
subject to 0;(s;) >. A perfect equilibria is the limit of an e-constrained 
equilibrium as € goes to 0. 


It is to see how the unanimous voting equilibria fail to meet this 
definition since in the ¢-constrained equilibria all players vote for their 
least preferred candidate with probability ¢«. Thus, the limit of these 
equilibria are strategy profiles which place zero probability on voting 
for the lesser candidate. Indeed, the only perfect equilibrium is the 
one where all voters vote for their favorite candidate. 

Perfect equilibria have some desirable properties. First, all perfect 
equilibria are Nash equilibria of the game without the ¢€ constraints. 
Thus, the set of perfect equilibria are a proper subset of the set of 
Nash equilibria. Secondly, it can be shown using arguments similar to 
the existence of Nash equilibria that all finite game normal form games 
have at least one perfect equilibria.” 


13. Application: Private Provision of Public Goods 


Since the publication of Mancur Olson’s Logic of Collective Action, 
a central question in political science has been the conditions under 
which individual rational agents would be willing to incur personal 
costs to contribute to the public good. In this section, we present a 
game theoretic model of such contributions. This model is based on 
the work of Palfrey and Rosenthal (1984). 

Assume that there are n agents who must decide whether to provide 
a public good. Provision requires the contribution of a single individ- 
ual and produces a utility of 1 unit for each individual. However, any 


!2The proof follws from the fact that the e-constrained mixed strategy space is 
compact, convex, and non-empty. 
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contributor pays a cost c < 1. Thus, the strategy set for each agent is 
{contribute, don’t contribute}. To simplify notation, we define “con- 
tribute” as s; = 1 and “don’t contribute” as s; = 0. Since we will also 
consider mixed strategy equilibria, let 7; be the probability that agent 
i contributes. Given this setup, the payoff for each agent is 1 — c if 
she contributes, 1 is she doesn’t contribute but some other agent does, 
and 0 otherwise. 

First, consider the set of pure strategy equilibria. It is easy to 
check that for each agent 7 there is an equilibrium where s; = 1 and 
s_; =0. Agent 7 receives 1 —c while all other agent receive a utility of 
1. Clearly, agent 7 will not defect since failing to contribute will lower 
her payoff to 0. Similarly, no other agent with defect to contributing 
since it will lower his payoff from 1 to 1—c. Now we can check to see 
that no other combination of strategies is a pure strategy Nash equi- 
librium. First, consider s; = 0 for all 7. In this case, any agent would 
do better by contributing so that this profile cannot be an equilibrium. 
Next consider a strategy combination where more than one agent con- 
tributed to the public good. Clearly, any contributor would increase 
her utility by withholding the contribution since it would be provided 
by the contribution from another agent. 

It is important to note that the equilibria to this game are quite 
different from the decision theoretic predictions of Olson. In fact, all 
of the equilibria to this game are Pareto efficient since that the public 
good is provided by just enough contributions. However, there are 
many reasons to think that this equilibrium is not a very valid descrip- 
tion of how this game would actually be played. Most importantly, 
since there are so many Nash equilibria, how would the agents ever 
coordinate on one of them? Secondly, each of the pure strategy Nash 
equilibria involves ex ante identical agents playing different strategies. 
An equilibrium where identical agents played identical strategies would 
seem more plausible. For these reasons, a more plausible equilibrium 
is a mixed strategy equilibrium where every agent contributes with a 
probability determined in equilibrium. Since there may be many such 
equilibria, we will only consider the symmetric mixed strategy equilib- 
rium where o; = o for all i. This restriction is consistent with our 
criticism of the asymmetry inherent in the pure strategy Nash equilib- 
ria. 

Recall that for a mixed strategy to be a best response for agent 
i, she must be indifferent among the pure strategies that she mixes 
over. Thus, if agent 7 is willing to play o; against n — 1 players con- 
tributing with probability o, it must be true that u;(contribute,o) = 
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u; (not contribute, 0) or 
1-—c=1-(1-0)"" 


Using this condition, we can solve for the equilibrium value: o = 1 — 
c™T, Note that in some ways this equilibrium is more consistent with 
the predictions of Olsen. As n gets large, 0 goes to 0 while if c goes 
to zero o goes to 1. 


13.1. Multiple Contributions. Now we will consider an exten- 
sion of this model where the public good is provided only if & out of 
the n. It is easy to see from the logic presented in the previous section 
that there are many pure strategy equilibria where exactly k agents 
make contributions. Clearly, in such an equilibrium, any agent not 
contributing has no incentive to defect and make a contribution as it 
will cost c and not change the probability of obtaining the public good. 
Conversely, any contributor who defected would cause the good to not 
be provided. Since saving the contribution cost is less valuable than 
losing the public good, such a defection will not occur. Therefore, 
there is an equilibrium corresponding to contributions by every possi- 
ble combination of & agents. From a basic result in combinatorics, we 
know that there are exactly 


distinct Nash equilibria where n! = 1-2-3...-n. The notation ( ; ), 
known as the binomial coefficient, represents the number of combina- 
tions of k elements drawn from n objects. 

The plausibility of these pure strategy equilibria are perhaps even 
less compelling than the pure strategy equilibria of the one contribu- 
tion game. So again we will compute the symmetric mixed strategy 
equilibria for this game. 

Let x_; represent a realization of o_; so that it is number of contri- 
butions made agents other than i. The payoff to i from contributing 
is 

Pr(a_;<k-—1)-0+Pr(#_,;>k-1)-l-c 
while the payoff from abstaining is 
Pr (a_; < k)-04+Pr(a_;>k)-1 


As before, playing mixed strategies requires that the agents be indiffer- 
ent among the pure strategies in the mixture. Equating these payofts 
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and doing a bit of algebra, the necessary condition for mixing is 
PPE3ahea Se 


This condition has a nice intuitive interpretation. Clearly, contributing 
only has a positive benefit if exactly k — 1 other agents contribute. 
Thus, the payoff from the contributing to the public good is discounted 
by the probability that a contribution is pivotal. Because agents will 
be mixing, this expected benefit has to be equated to the contribution 
costs c¢. 

Since we are assuming that all agents are independently playing the 
same mixed strategy a, we can compute the exact value of Pr (a_; = k — 1) 
as it equals the probability of obtaining exactly k — 1 successes in n — 1 
trials with a success probability of o. Thus, a standard result in prob- 
ability theory implies that 


Therefore, computing the symmetric mixed strategy equilibrium in- 
volves find the set of a that solve: 


Cs Jota —oyt ae 


k-1 nk _ (k—1)!(n—k)! 
Ge i in — 1) é 
Before characterizing exactly what the set of solutions looks like, it 
is worthwhile to look at a couple of examples. First, suppose that 
n=5and k=3. Then the equation reduces to 0? (1 — 0)? = 4c. The 
solid lines of Figure 5.9 plots the left and right sides of this equation. 
Note that as long as c is sufficiently low, there are two solutions to the 
equation which represent distinct mixed strategy equilibria, 07, < 5 < 
a7. It is easy to see how the equilibrium mixtures change as a function 


of c. The effect of increasing c is to raise 07, and lower o7,. 


or 


Insert Figure 5.9 


Figure 5.9 also plots the conditions for k = 4 and k = 5 which are 
given by 0? (1—o) = zc and a4 = c respectively. The case of k = 4 
is similar to k = 3 in that it also has two mixed strategy equilibria. 
However, o°(1—¢) > o?(1—a)’ ifo > 4. This effect plus the fact 
that 4c > ¢c implies that o%, is increases in k. Since 0? (1—o) < 
o?(1—a)* if o < 4, o% also increases. In the case of k = 5, the fact 


that o* is an increasing function for 0 < o < 1 implies that there can 
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only be one mixed equilibrium. It has a higher contribution probability 
than 07, when k = 4 which increases in c. 

Many of the implications of the examples generalize. First regard- 
less of n and k, there can be at most two mixed strategy equilibria. To 
see this, let c(o) be the level of costs that supports o as the equilibrium 
mixing strategy or 


c(o) = ( ‘ae Jon d=a* 


Differentiating with respect to 0, we find that 


dy=( RIE) DO-2)- (Holo -oy™ 


It is easy to see that if k <n the function c(c) is single peaked since 


c = 0 ifa = —— 
Thus, ifk<nandc<c( 
equilibria, but none if ¢ > Cmax.'® If k =n or k = 1, there will be one 
mixed strategy equilibrium so long as c < 1. 

We can also use this result to say something about what happens 
as n gets very large. Since c(0) = 0, we know that that Gna, goes to 
zero as n gets large. Thus, in the limit there are no mixed strategy 
equilibria. 

For 07, > 


) = Cmax, there will be two mixed strategy 


k-1 
n—-1? 


ink. For oj < 4, the contribution probabilities fall increase in c 


contribution probabilities fall in c and n and increase 


and n. For 07 < — contributions are falling in k while they increase 
in k if oy € (=, S4)." 
n ? n-l 


14. Exercises 


EXERCISE 5.1. In the Hotelling model, show that there is no equilib- 
rium in pure strategies if there are three parties for any specification of 
the parties objectives.. What is the Nash equilibrium with four parties 
if parties maximize vote share? 


EXERCISE 5.2. Show that the prisoner’s dilemma has no mixed 
strategy equilibrium. 


'30f course in the unlikely event that ¢ = Cmax, there will be a single symmeteric 
mixed stratgey equilibrium. 

4Some derivations useful in proving these claims is available in the appendix 
to Palfrey and Rosenthal (1988). 


114 5. GAMES IN THE NORMAL FORM 


EXERCISE 5.3. Find the mized strategy equilibrium of the following 
game: 


EXERCISE 5.4. Verify that the two definitions of Nash Equilibrium 
are equivalent. Hint show that if a strategy profile satisfies the first 
then it must satisfy the second, and then that if it satisfies the second 
it must satisfy the first. 


EXERCISE 5.5. Prove Theorem 5.6. 
EXERCISE 5.6. * Prove Theorem 5.6. 


EXERCISE 5.7. *Show that an upper hemi-continuous correspon- 
dence that is single valued is a continuous function. 


EXERCISE 5.8. Characterize the pure strategy Nash equilibria to the 
International Externality game when ky = ko. 


EXERCISE 5.9. Verify that if f(-,-) has increasing differences in 
(x,p) then for all p < panda < a! f(x,p') — f(a,p) = f(a',p') - 
f(z’, p). 


EXERCISE 5.10. *Prove parts a-c of Proposition 5.2. 


EXERCISE 5.11. *Assume that X is a compact subset of R" and R 
is a lower continuous partial order on X. Show that if u(x) is a utility 
function that represents R on X then u(-) is upper semi-continuous on 
X. Now show that if u(x) is upper semi-continuous on X then any 
preference relation that it represents is lower continuous. 


EXERCISE 5.12. Consider the Palfrey-Rosenthal contribution game. 
Construct an asymmetric Nash equilibrium where | agents contribute 
(0; = 1), m agents do not contribute (0; = 0), andn —m—1 agents 
choose a mixed strategy 0; = q € (0,1). Show that ifl >0 orm> 0, 
q* is unique. Is this a stable equilibrium? 


EXERCISE 5.13. Consider an extension of the Palfrey-Rosenthal 
model where contributions are refunded if the public good is not pro- 
vided (i.e. fewer than k contributions are made). Characterize the 
pure strategy and mixed strategy equilibria of this game. 


CHAPTER 6 


Bayesian Games in the Normal Form 


In the normal form games considered above there was no uncer- 
tainty. The agents know their own payoffs and those of their oppo- 
nents. More precisely, the entire structure of the game involves what 
is called common knowledge —each player knows all the details of the 
game and each player knows that each player knows the details of the 
game, and each player knows that each player knows that each player 
knows ..... ad infinitum. However, in many settings this assumption 
is inappropriate and we might suspect that interesting strategic incen- 
tives are created by uncertainty. Returning to the“Terrorist Hunt” 
game, 


| Table 6.1: The Terrorist Hunt 
FBI\CIA | Kingpin | Operative 
Kingpin 0,1 
Operative 1,0 bal 


We might think that the CIA is not sure if the FBI prefers arresting 
Operatives to not arresting anyone. The loss of an operative may not 
lower terrorism risks but involve dramatic administrative headaches. 
So the CIA may think that it is possible that the game looks like 


Table 6.2: Modified Hunt 1 


FBI’'\CIA | Kingpin | Operative 
Kingpin 2 0,1 


Operative 


But suppose that the CIA fears that the FBI may have yet a dif- 
ferent preference ordering, preferring to arrest Operatives more than 
Kingpins, because Operatives usually fold under the pressure providing 
lots of information, whereas Kingpins remain silent. In this case the 
game might look like 
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Table 6.3: Modified Hunt 2 
FBI’\CIA | Kingpin | Operative 

Kingpin 12 0,1 

Operative 2,0 pa 


In this setting the CIA’s calculation of which strategy to play may 
differ. We saw that in the original game the pure strategy Nash equi- 
libria are (Kingpin, Kingpin) and (Operative, Operative). But if the 
CIA thinks that the FBI is possibly of one of these alternative types 
then it is less clear which strategy the CIA should play. In this case we 
cannot apply our notion of Nash equilibria or dominance to the game. 

In this section we develop tools to analyze richer models involving 
agents that do not know the payoffs of the other players. This feature 
is termed incomplete information. The standard practice (origi- 
nated by Harsanyi 1967-68) is to convert such a game into one where 
a fictional player (nature) moves first drawing the utility functions of 
the agents from a probability distribution that is known to the players. 
Following this draw, agents simultaneously select their actions. This 
approach is called one of imperfect information. So in the modified 
“Terrorist Hunt” we might think that Nature selects the preferences (or 
type) of the FBI by tossing a fair three sided die —making each type 
equally likely. The FBI knows its type and chooses an action. The 
CIA does not know the FBI’s type and chooses its action. In this case, 
specifying strategies for the CIA is somewhat more complicated. In as- 
sessing the desirability of a strategy, the CIA needs to form conjecture 
about the strategy that each of the three possible FBI types will use. 
Given such a conjecture, the CIA can compare the expected utility of 
choosing Kingpin or Operative. Conversely, given a conjecture about 
the CIA’s strategy, each of the possible types of FBI should choose a 
strategy that maximizes its utility. In essence we have translated a 
problem in which the CIA does not know the preferences of the FBI 
to a new game in which the CIA is playing one of three possible FBI 
players (which are drawn from a known distribution) and each player 
(three FBI types plus the CIA) are all playing strategies. 

Aside from the possibility that agents may not know each others 
preferences, it is possible that agents do not know their own prefer- 
ences. Returning to the original “Terrorist Hunt” problem, a more 
realistic model involves a probability of catching a terrorist conditional 
on strategies by the players. We might think that the original matrix 
is justified by the following matrix. 
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Table 6.4: Terrorist Hunt with Uncertainty 


FBI\CIA Kingpin Operative 
Kingpin | 10 x = 4 Ua Des ptOX = Beas 
Operative 6x T+0x 2 50 6x i eee ue T+OxE 


Here 10 is the utility payoff to catching a Kingpin and : is the prob- 
ability of catching a Kingpin if both agencies cooperate on Kingpin 
searching. Alternatively 0 is the payoff to failed Kingpin search- 
ing which occurs with probability 2 when the agencies cooperate on 
Kingpin searching. While the overall payoffs are the same in this 
matrix and the earlier one, this representation explicitly captures the 
fact that payoffs may depend on both strategies and some additional 
randomness in the world. 

A comment is in order about the role of common knowledge in 
games of incomplete information. As we pointed out at the beginning 
of the chapter, we assume in games of complete information that all 
elements of the game — players, strategies, and payoffs — are known to all 
players and all players know this. In games of incomplete information, 
we still maintain the common knowledge assumption. In particular, we 
assume that all players known the probability distribution that Nature 
uses in selecting player types and that all players know that all players 
know and so on. 


1. Formal Definitions 


We now modify our basic normal form structure [I to account for 
both imperfect information about player types and payoffs that depend 
on additional randomness.! We begin by adding to I’ player types 
and a known lottery over these types and an additional random state 
variable. 


(1) Types: We assume that for each player, i € N there is a finite 
set 0; of possible types. In the first example of this section, 
this set is a singleton for the CIA and it includes three elements 
for the FBI. By 6_; and O_; we denote a profile (and the set 
of such profiles) of types for all players other than 7. 

(2) Random state: In addition there is a random variable w € 
Q (which is assumed to be a finite state). In the second 
example of this section w could be thought of as the realization 
of searching given different strategies. So w specifies whether 


'Some readers may want to review probability theory in the mathemtical ap- 
pendix before proceeding. 
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a Kingpin or Operative will be found given every strategy in 
S; 
(3) Natures randomization: At the beginning of the game na- 
ture selects the vector of player types 0 = (01,...,0n) € 0 = 
[lien O; and w € from a joint lottery assigning each pair 
(0,w) probability p(0,w). By p(@_i,w | 6;) we denote the 
conditional probability of #,w given 6;. 
(4) Strategies: Each player selects an action s; in the set Sj. 
Expected utilities: For each possible strategy profile s, type 6; and 
state w agent i has utility u;(s;, s_;,0;,w). Given her type 6; agent 7’s 
conditional expected utility from strategy profile s is 


Eu;(s; 0;) =) n(0 i, | 0;)us(s, 6;, w). 
wEN 
Accordingly a normal form Bayesian game is defined by the collection: 
(N,Q, {5;, 8:, u(-, ..., +) bien, pP(-,*)). We sometimes use the shorthand 
(N,Q, S,0,u,p). Just as normal form games may be defined with non 
finite strategy spaces, Bayesian games may be defined with infinite type 
and action spaces. We provide several such examples below. 

In a normal form a strategy profile is just a list s €¢ S. Ina 
Bayesian game we need to record the strategy that each possible type 
of player will use. Accordingly in a Bayesian game a strategy for 
player i is a function ¢,(0;) : 0; — S; that selects a strategy s; € S; 
for each possible type 6; € 0;. In the version of “Terrorist Hunt” in 
which the CIA is not sure of the FBI’s preferences, if we consider the 
FBI types as Org, ={standard, pro — Kingpin, pro — Operative }each 
occurring with equal probability then an example of a strategy for the 
FBI is épp,(standard) = Kingpin, dpp,(pro — Kingpin) = Kingpin, 
Orpy(pro — Operative) = Operative. 

In a Bayesian normal form game we can extend the idea of Nash 
equilibria to the concept of Bayesian Nash equilibria. Since discussion 
of best responses when a strategy itself is a function is a little com- 
plicated, the easiest way to define a Bayesian Nash equilibrium is to 
extend our second definition of Nash equilibrium. 


DEFINITION 6.1. Given a normal form Bayesian game (N,Q, 5,9, u 
a Bayesian Nash equilibrium is a profile of strategies, (¢;(-), ...¢;,(-)) 
such that for every i € N, for each 0; € O; 
(6.1)  EU;(0;(9:), 2 4(-); 91) = EUi(s;, @,(-); %) for every 5; € Sj. 


Thus, in a Bayesian Nash equilibrium, every type of each player 
chooses strategies that maximize their expected utility given strategies 
of all other player types and the probability distribution of those types. 


,P) 
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Returning to the multiple-type FBI game, we can solve for a Bayesian 
Nash equilibria. Suppose that the FBI strategy is as specified above, 
bpp (standard) = Kingpin, dpgr(pro—Kingpin) = Kingpin, dppr(pro— 
Operative) = Operative. In this game the CIA has only one possi- 
ble type and there is no uncertainty other than FBI types so we can 
suppress Ww. Given this strategy, we have 


eee 2 
EUcra(Kingpin,¢ppr(-)) = 32 ue 


: 2 
EUcra(Operative,dppr(-)) = 50+ 


Thus, the CIA’s best response to @pp,(-) is to select Kingpin. Given 
this we must verify if any of the potential FBI types wish to deviate. 
For the standard type FBI we know that matching the CIA is a best 
response S0 @p,(standard) = Kingpin is a best response. For the pro- 
Kingpin type selecting Kingpin when the CIA selects Kingpin results 
in the highest possible payoff (2) and thus no desirable deviation exists. 
Finally, for the pro-Operative FBI, selecting Operative when the CIA 
selects Kingpin results in utility of 2 while a deviation to Kingpin 
results in utility 1, thus no desirable deviation exists. This means that 
we have characterized a Bayesian Nash equilibrium to the game. 


2. Application: Trade restrictions 


We consider a simple setting involving two nations that are con- 
templating restrictive trade policies. Let N = {1,2} and suppose that 
each country has two possible types 0; = {u,b} A type u country 
wishes to limit its imports from the other country unilaterally, while 
a type b country wishes to pursue a bilateral policy of limiting trade 
only if the other country does so. We assume the country types are 
independently drawn with type wu occurring with probability p € (0,1). 
The strategy space for each country is S = {l, f} where / denotes a 
enacting an import limit and f denotes a free-trade policy. Country 7 
has the following payofts 


J Ss So = yan 0; =e 
Zits: =f, $= 7 and.6; = a 
lif. s;=1,s_;= 1 and 6;= a 
Oil se] f,s2,=1 and 6; = 4% 
Sib ej. eop = f and. Gs b 
21 s=1,6.;= f and 0;=b 
1 if s;=l,s_;=land0;=b 
Qif sys] 7,55 = land 6;—0 


Ui(Si, S—i; 6;) = 


120 6. BAYESIAN GAMES IN THE NORMAL FORM 


A strategy is a mapping s;(0;) : {u,b} — {l, f}. Note that a u country 
always gets a higher payoff from / regardless of the actions of the other 
country. Thus, if there were common knowledge that both countries 
were type u then the game would be a Prisoners’ Dilemma and each 
country would have a dominant strategy to choose /. Alternatively, 
if there was common knowledge that both countries were type b then 
there would be two pure strategy Nash equilibria (f, f) and (J, 1). 

In computing, Bayesian Nash equilibria, it is often useful to make 
conjectures about equilibrium strategies and check to see if they satisfy 
the required conditions. Since we know that a type u country has a 
dominant strategy of selecting / every equilibrium will involve s;(u) = 1. 
Thus, the only possible equilibria are (s;(u),s;(b)) = (U,1) and (1, f). 
Thus, we first investigate the possibility that s;(u) = / and s;(b) = f 
for both 7 = 1,2. If country 2 uses this strategy, then sy = | with 
probability p and sy = f with probability (1 —p). Thus, country 1’s 
expected utility is 


Fu;(s1, 0; = u) — { p+(1—p)3 ifs; =1 


2(1—p) ifs, =f 
21-—p)+pifs,=1 

3(1—p) ifs. =f 
When is the conjectured strategy a best response? The strategy 


s;(u) = 1 is a best response since type u has a dominant strategy 
to erect trade barriers. Alternatively, s;(b) = f is a best response if 


3p) > 2p) sp 
which is true as long as p < 5. Since the calculations for country 2 are 
identical, we know that for p < } the profile s;(u) =1 and s;(b) = f is 
a Bayesian Nash equilibrium. 
Now we lets check to see if s;(b) = / for both countries can be a 


best response. We note that if country 2 uses the strategy s;(0;) = 1 
regardless of 6; then country 1 with type b has the expected utility 


Lit sy = 1 
Fu; (81,0; = b) = { 0 - — ie 


This means that regardless of p, the strategies s;(b) = 1, s;(u) = 1 for 
both countries is a Bayesian Nash equilibrium. So we have seen that 
there is always a Bayesian Nash equilibrium in which bilateral limits 
(1,1) occur. Moreover, if p < 4 there is a second equilibrium with 
s;(u) = 1 and s;(b) = f. In this equilibrium free trade occurs if both 
countries are of type 6, which happens with probability (1 — p)?. So 
the possibility of bilateral policies requires that a country is not the 


Eur(s1, 9; = b) => { 
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unilateral type and the expectation that her opponent is also unlikely 
to be the unilateral type. 


3. Application: Jury Voting 


As another example of Bayesian Nash Equilibrium, we now consider 
a simple example of the jury model developed by Austen-Smith and 
Banks (1997). Suppose that three jurors N = {1, 2,3} are responsible 
for deciding whether to convict or acquit a defendant. Thus, they 
collectively must choose an outcome x € {c,a}. The jurors simulta- 
neously cast ballots v; € S; = {c,a} and the outcome is chosen by 
majority rule. Each player faces uncertainty about whether or not the 
defendant is guilty,G, or innocent, J. So the set of state variables can 
be denoted as 0 = {G,/}. In the guilty state the jurors receive utility 
1 from convicting and 0 from acquitting. Alternatively, in innocent 
state the jurors receive utility 1 from acquitting and 0 from convicting. 
Each player assigns prior probability a > 5 to the guilty state. 

If we assumed that each of the jurors had identical information, 
each juror would receive an expected utility of 7 from a guilty verdict 
and 1 — 7 from an acquittal. Since 7 > 1 — 7, the Nash equilibrium 
that survives the elimination of weakly dominated strategies calls for 
each juror to vote guilty. 

However, assume that before voting each player receives a private 
signal concerning the defendants guilt 0; € {0,1}. We assume that this 
signal is informative in the sense that a juror is more likely to receive 
the signal 6; = 1 when the defendant is guilty than she is when the 
defendant is innocent. Furthermore, to keep things very simple, we 
assume that the probability of receiving the guilty signal (0; = 1) when 
the defendant is guilty is the same as that of receiving the innocent 
signal (0; = 0) when the defendant is innocent. Formally, we assume 
that Pr(@; =1|w = G) = Pr(0; =0|w =I) =p > § which obviously 
requires that Pr(0; =0|w=G) = Pr(6; =0|w=J)=1-p. 

After receiving her signal, voter 7 will select her vote v (0;) to max- 
imize the probability that guilty defendant are convicted and innocent 
defendants are acquitted. Suppose that each voter uses a straightfor- 
ward strategy v;(0) = a and v;(1) = c of voting to convict when they 
get a signal of 1 and voting to acquit when the get a signal of 0. To 
verify whether this strategy combination constitutes a Bayesian Nash 
equilibrium, we need to verify that voter 1 is willing to use this strategy 
if she conjectures that voters 2 and 3 are using this strategy. Given 
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these conjectures, the expected utility of voting to convict is 
Pr(02 = 1 and 63 =0 andw =G | @)+ 
Pr(63 = 1 and 6) = Oand w = G | 6;)+ 
Pr(02 = 1 and 62 =1 andw =G| 6;)+ 
Pr(02 = 0 and 62 =0 and w =I | 04) 
while the expected utility of voting to acquit is 
Pr(@2 = 1 and 63 = 0 andw =I | 0,)+ 
Pr(@3 = 1 and 62 = Oand w = I | 0)+ 
Pr(@2 = 0 and 62 = 0 andw =I | 0,)+ 
Pr(@2 = 1 and 62 =1andw=G | j). 


Note that the last two terms of each sum are the same and thus cancel 

out in comparing these two expected utilities. Accordingly voting to 

convict is a best response if 

Pr(62 = 1 and 63 = 0 andw =G | 6) + Pr(03 = 1 and 02 = Oand w = G | 41) > 
Pr(0. = 1 and 63 = 0 andw =I | 0,) + Pr(@3 = 1 and 62 = 0 and w =1 | 6) 
while voting to acquit is a best response if the reverse weak inequality 

holds. Since these expressions depend on conditional probabilities 

of observing combinations of the state variable and the signals of the 


remaining jurors, juror 1 must use Bayes’ Rule to evaluate each term. 
Suppose that juror 1 receives 6; = 1. If can be easily shown that 


Pr(#, = 1 and 63 = 0 andw =G | 6, = 1) 

2 

mp (1 — p) 

= Pr(@3 = 1 and 6.2 =O andw=G| 6, = 1) = ——— 
RS ES ENE SOE) ep eon) 

and 

Pr(02 = 1 and 63 = 0 andw =I | @; = 1) 

(1—7)p(1—p) 


= Pr(63 = 1 and 02 =0 andw =1 | 6, = 1) xp +(1—m)(1—p) 


Thus, juror 1 will choose v; (1) = c if 


(1 —2)p(1 —p)? 
mp +(1—m)(1—p) 


mp?(1 —p) 
mp + (1—7)(1 — p) 
or after simplifying 


mp(1 — p)p > (1—7)p(1 — p)(1 — p). 


9 
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We can rearrange this expression to 


mp*(1 — p) 1 

mp?(1—p) + (1—7)p(1—p)? ~ 2 
The left hand side is just the conditional probability of guilt given two 
signals of 1 and one signal of 0. In other words agent 1 is willing to 
vote to convict if she thinks that the defendant is more likely to be 
guilty than innocent when she conditions on her signal (one of the 1 
signals) and the assumption that she is pivotal so that the remaining 
two agents have received different signals. Similarly, we can express 
the requirement for a vote of innocence conditional on a signal of 0 as 


mp(1 — p)? 1 

<n. 

mpl =p)y += a)po =p) ~ 2 
So the conclusion is that in a Bayesian equilibrium to this game a voter 
must vote for the action which she prefers in the state she believes is 
more likely given that she conditions on both her own signal and the 
profile of other signals that must occur for her to be pivotal. Austen- 
Smith and Banks note that in many cases this simple strategy (voting 
to convict if 6; = 1 and voting to acquit if 0; = 0) is not equilibrium 
behavior. In this example this point can be demonstrated by choosing 
parameters 7 and p for which one of the last two inequalities does 
not hold. Of course alternative types of strategies might be played. 
Voters can randomize for some types, or they may choose to vote the 
same way regardless of their type, or different voters may use different 
strategies. Fedderson and Pessendorfer (1998) consider the properties 
of equilibria to this game as one varies the voting rule and population 

size. 


4. Application: Jury Voting with a Continuum of Signals* 


One extension of the jury model involves a larger type space.” Sup- 
pose instead of receiving a binary signal, each juror receives a signal 
6; € [0,1]. One way to model the case of an informative signal taking 
on a continuum of possible values is to assume that 0; is drawn from 
a state conditional distribution F'(6;|w) with a differentiable density 
function f(6;|w) that satisfies the monotone likelihood ratio condition. 


DEFINITION 6.2. The conditional densities satisfy the strict monotone 
likelihood ratio condition (SMLR) if a is a strictly monotone func- 
tion of 6; on [0,1]. 


?Duggan and Martinelli (2001) and Meirowitz (2002) consider this extension. 


124 6. BAYESIAN GAMES IN THE NORMAL FORM 


For the convenience, we assume that an is strictly increasing. 


To see why this assumption is important, note that Bayes’ rule implies 
that 


f(G | G)r 
iG) =e 
CP) = Fe Gm + FG Dam) 
£(0:|G) 
= Ca 
Feat + (L—7) 


It can easily be verifies that Pr(G|@;) is increasing in 6; if and only if 
aan is increasing in 6;. Thus, the SMLR condition implies that the 
higher the signal agent 7 receives the higher her posterior belief about 
the probability that the state is w = G. 

To keep things simple, we will focus exclusively on symmetric strate- 
gies where voters who receive the same signal choose the same strat- 
egy. Thus, a symmetric strategy profile is characterized by a mapping 
u;(6;) : [0,1] > {c,a}. Following the logic from above, a Bayesian 
Nash equilibrium must involve a strategy that is optimal when the 
agent conditions on her private information and the conjecture that 
she is pivotal. An agent will vote to convict if she thinks the probabil- 
ity of guilt is no less than } and she will vote to acquit if she thinks the 
probability of guilt is no more than 5. Given that higher signals are 
better indicators of guilt, one natural conjecture is that the strategy 
must be weakly increasing. For low values of 6; an acquittal vote is 
cast and for high values of 0; a conviction vote is cast. Let @ € [0,1] 


denote a cutpoint and lets assume that agents i € N\i use the strategy 


a ase) 
(;) = cif 0,20 
(i) ee 


Suppose that the juror decision rule is a q-rule, requiring at least q > 
wt votes to convict. Thus, if players N\i use the cutpoint strategy 
then the posterior probability of {w = G} given 6; and that 7 is pivotal 
is given by 


a 


(6.2) pr(G | piv, 6;;6) = 


Fin—q-1 a eet 
nfok j1 - Fo| 


Tain—q-1 el (ee Tin—q-1 ae 
rfok j1— Fo| +(l—a) fr j1- A 


where f,, = f(0; | w) and F, = F(@|w). We leave the derivation of 
this expression as an exercise. This probability is a function of the 
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parameter @ and we make this point explicit in the left hand side. In 
this model the existence of a symmetric equilibrium in which voters 
use a cutpoint hinges on finding a value of @ such that 

~ 1 


pr(G | piv, 0;6) = a 


and demonstrating that if 0; < 0 pr(G | piv, 6;; 0) < 5 and if 6; > 0 
pr(G | piv, 0;:0) > 4. While analysis of examples is cumbersome, it 
is easy to come up with conditions on the primitives of the game to 
insure that such a 6 € (0,1) exists. First, since pr(G | piv, 0;; 0) > 4 
if and only if 


(-m)fG:|DFE| Nr fi- FO] 


@.|a) Ola E ~ F(6| Gy] 


IV 
— 


“~ q-1 


‘d 
FOND q —ayr@| Qe- [1 -F@|D 
the strict monotone likelihood ratio conditions implies that if pr(G | 
piv, 0:6) = $ then 6; < @ implies pr(G | piv, 0;:0) < $ and 6; > 0 im- 
plies pr(G | piv, 6;; 0) > 4. This means that existence of an equilibrium 
hinges on establishing the existence of a solution to the equation 
eee. 
pr(G | piv, 6,0) = 5 
If pr(G | piv,0;0) < $ < pr(G | piv, 1;1) then the intermediate value 
theorem implies that such a cutpoint will exist since the function pr(G | 
piv,-;-) is continuous. For a large class of games these boundary 
conditions will be satisfied. 

So while the simple binary type model demonstrates that equilibria 
where everyone uses the same rule and voting is determined by private 
information will not exist, equilibria of this type generally exist in the 
continuum model. In the case of unanimity rule, the modeling tech- 
nology is consequential for the conclusions that can be reached about 
the desirability of particular political institutions. Using the binary 
model, Fedderson and Pesendorfer (1998) show that unanimity rule is 
an uniquely poor way to aggregate information for large populations 
because in equilibrium voters condition on the assumption that every- 
one else is voting to convict. In the continuum model (Meirowitz 2002) 
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unanimity rule turns out to be as good as the other voting rules in some 
cases. 


5. Application: Public Goods and Incomplete Information 


In this section, we consider a version of the Palfrey-Rosenthal con- 
tribution game where potential contributors are uncertain about the 
contribution costs of other players. To keep the model as close to 
the one we have already analyzed, we assume that every agent receives 
a utility of 1 if at least k agents contribute and 0 otherwise. Agent 
i pays a cost c; to contribute where we assume that c; is distributed 
uniformly on [0,1]. Each agent learns their own costs, but remain 
uncertain about the costs of others. 


5.1. k = 1. First we consider the case where a single contribution 
is necessary for the provision of the good. Since c; < 1 with certainty 
for all contributors, there are always n Bayesian Nash equilibrium cor- 
responding to agent 7 contributing with certainty. However, as we did 
before, we will concentrate on symmetric equilibrium where all player 
types with the same cost play the same strategy. We therefore focus 
on equilibria in cutpoint strategies. In such equilibria, agent 7 con- 
tributes if and only if c; <c. If we assume that all players other than 
i choose a cutpoint strategy, agent 2 utility from contributing is 1 — c;. 
To compute her utility for not contributing, note that she receives 1 
as long as there as at least one contributor. Since c is distributed 
uniformly on [0,1], other contributors contribute with probability C so 
that the probability of no contributions is [1 —¢]""'. Thus, agent i’s 
utility from not contributing is 1—[1 — q]""*. Thus agent i contributes 
so long as 

tac ee 

We can use this expression to solve implicitly for C since agent 7 must 
be indifferent at the cutpoint. Thus, we have ce !+¢C= 1. It is very 
easy to show that ¢ is declining in n and goes to zero for very large 
n. In turn, this means that the probability that any individual will 
contribute goes to zero as the group expands. ‘Thus, the incomplete 
information version also predicts that there will be more free-riding in 
large groups. 


5.2. k > 1x. Now we turn to the case where multiple contributions 
are required for the provision of the good. We again assume that agents 
use cutpoint strategies and contribute only if ¢; < ¢. 

Let x_; be the realized number of contributions from agents other 
than 7. From arguments identical to those of the last chapter, we know 
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that agent 2’s net utility from contributing is 
Pr(z_; =k—-1)—-<¢; 
Since contribution probabilities are C, we can also show that 
Pigs hal) = ( ‘ae le ie" 


Since the agent 7 must be indifferent at the cutpoint cost, we again get 
an implicit solution for Cc: 


== 1 2 an— 
Ca5G 2(1-@" *=1 


Note that this solution is very similar to that of the mixed strategy 
equilibrium with complete information. The main difference is that ¢ 
plays the role of o*. Thus, many of the implications of our previous 
analysis carry over. 


To reduce notation let H(¢) = at 2 (1-@" so that 
our equilibrium condition is H(c) = 1. First note that 
k-—2 
I 20 if¢= —— 
zUiCl 


Thus, so long as 2 < k < n, II(C) is single peaked for ¢ € [0,1]. Thus, 
if2 <k <nandmaxII > 1, there will be two equilibrium cutpoints cy 
and cy, just as there were two symmetric equilibria in mixed strategies. 
If max II < 1, there will be no cutpoint equilibria. 

As before, it is easy to demonstrate that the cutpoint equilibria 
disappear as n gets very large. Note that the ¢ that maximizes II 
is bounded by a Thus, as n gets large, this maximizer goes to 
zero. Since II(0) = 0, tim. [max ]]] = 0. Thus for sufficiently large n, 
maxII < 1. We leave it to the readers to verify that the effects of 
n and k on Cy and €y are essentially the same as the effects of n and 
k on of and o7, in the symmetric mixed strategy equilibrium of the 
previous chapter. 


6. Application: Electoral Competition under Uncertainty 


We now return to the Hotelling model of candidate competition 
with policy motivated candidates and consider two extensions. We 
first assume that candidate preferences are known (candidates have 
ideal points of 0 and 1 as before) but that instead of knowing that the 


3mn the unlikely event that maxII = 1, there will be a unique cutpoint 
equilibrium. 
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voters are arranged uniformly on the unit interval, we assume that the 
candidates believe that the median voter’s location is randomly drawn 
from the uniform distribution on [0,1]. This model is an example of 
models treated by Wittman (1977) and Calvert (1985). To formulate 
this model as a Bayesian game, we let 2 = [0,1] denote the set of 
possible locations of the median voter. Since candidate preferences 
are common knowledge in this example the type spaces are singletons. 
So all of the uncertainty in the game is captured by the probability 
distribution F(w) = w on [0,1]. Here, we assume that candidate 
preferences over policy are quadratic so that ui(x) = —a? and ua(x) = 
—(1—.2)?. Given two platforms s; < s2 candidate 1 wins if the 
median is closer to s; than s. This is true if the median is less than 
#1582 Since the median is uniformly distributed, candidate 1 wins with 


“1**2. In this case candidate expected utilities are 


probability 


281+8 2 sits : 
suslaye) = { “HH ~AL— BBY <a 
: —s5 2 — si(1 — 45%) if 51 > 89 


and 


—(1 — 51)? — (1 — 59)?(1 — 2422) if 5 < 89 


Eun(s1, $2) = { = ia Sq)? 5% _ (1 a7 $4)" io s1 +82) if S1 > S9 
To construct an equilibrium, suppose that candidate 1 knows that can- 
didate 2 will locate at z > 5. Then candidate 1 chooses s; € [0, z] to 
optimize 


8, +z 

=) 
Note that we can ignore the possibility of choosing s; > z because this 
strategy is always dominated by s; = z.+_ To find the optimal choice 
of s,, we can differentiate the objective function with respect to s; and 
set this derivative to 0. This first order condition is 

2 
-S3 — 28, + a 0. 

Solving for s; yields two solutions, but only one is in the appropriate 
range [0,1]. This solution then gives us the best response function 


oy, eal ate 


S 
$1(S2) = a 


To be sure that this solution characterizes a local maxima (as opposed 
to a local minima or saddle point) we need to check that the second 
derivative of the objective function is negative when evaluated at this 


‘Indeed if candidate chose s, > z, she would prefer candidate 2 to win. 
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value. This second order condition simplifies to —2s2 and is negative 
for any value of s2 € (0, 1]. 

Similarly, treating s; as a fixed parameter z < 5 we can differentiate 
candidate 2’s objective function 


max{—(1— 2)? 22 — (1 5,)? c : 2+2)) 


to find an optimal s2 € [z, 1]. The solution is 


1 
$9(51) =a + ger 
We leave verification of the second order condition to the reader. A 
Bayesian Nash equilibrium is then a strategy combination (sj, 55) that 


solves the system 


Rm 1 * 

$1 = 32 
85 = 3 oF 351: 

The unique solution to this system is 
ia deere 
= 7 


So with policy motivated candidates and uncertainty about voter pref- 
erences candidate divergence is predicted. 


6.1. Private Information about Candidate Preferences. Now 
we consider a model where in addition to uncertainty about the loca- 
tion of the median voter, candidates have private information about 
their policy preferences. One simple example involves candidate 1 
having ideal point 6, € {0,5} and candidate 2 having ideal point 
02 € {5,1}. So candidate the utility to candidate i of policy loca- 
tion x is u(x) = —(0;—x)?. For simplicity we assume that each type is 
drawn with equal probability and that choice of types across candidates 
are independent. As before we assume that the median voter’s ideal 
point is randomly drawn from a uniform distribution over [0,1]. In 
this case a strategy for candidate 1 is a mapping s;(61) : {0, $} — [0, $] 
and a strategy for candidate 2 is a mapping s2(42) : {5,1} — [4,1]. 
For simplicity, we ignore the possibility of a candidate selecting a policy 
that is further from her ideal point than the value 5. One justification 
for this assumption might be that parties constrain candidates from 
crossing over into the far side of the ideological spectrum. In this case 
suppose that candidate 2 uses the strategy s2($) = a and s9(1) = b. 
Now, in considering the optimal location for a candidate 1 with type 
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0; = % note that any location s; < } is dominated by the location 3. 
This is true because for fixed a, b the location 5 will win with probabil- 
ity strictly higher than the probability that a location of s; < 5 wins. 
Moreover, conditional on winning a location of s1 < $ is less desirable 
to candidate 1 with type 6; = 5 than a location of $. Accordingly, 
we know that s;(4) = 0 is strictly dominant. Similarly, s2($) = } is 
strictly dominant. Now to find the best response for candidate 1 of 
type 6, = 0 we need to solve an optimization problem. If 0; = 0 then 


candidate 1’s objective function is 


2 si +5 8, +6 (5)? si+5 b? 8, +6 
= eS Ea (ee (ae ae ecane (i ee 
rial si ( ae 2 2 2 zi)! 


Differentiating with respect to s; and setting this term equal to 0, yields 
the first order condition. 
1 t .. os. -9 


if 2 
3? St Far et T = 0. 


The solution (in the appropriate range) is 


1 1 1 
0;b) = —vV4b+ 160? +7-— -b- —. 

UH og ar ee ge ae 
Now instead of solving candidate 2’s problem in an analogous manner, 
we can notice that given s;() = 5 and s9(0) = a candidate 2’s problem 
is the mirror image of candidate 1’s. This means that we can find the 
equilibrium values of s2(1) = b and s(0) = 1—b that solve the relevant 


first order conditions by solving for 6 that satisfies the equality 


1 1 1 
1—b= —vW/4b + 1602 Spa SS 
b 2 b+ 1667+ 7 6° 


12° 


The solution is b = + — 4 106 ~ 0.83603. Thus, the Bayesian Nash 
equilibrium is s;(0) = 0.164, s1(5) = $, 52($) = 5, 52(1) = 0.836. It is 
instructive to compare the platforms of 6; = 0 and 62 = 1 with the out- 
comes of the game where candidate preferences are known. One might 
expect that the platforms would be more convergent given that each 
candidate think that they might be playing against a moderate candi- 
date. This intuition is misleading though since we observe platforms 
in the candidate uncertainty game that are even more divergent. The 
rationale for the seemingly anomalous outcome is that each extreme 
candidate type knows that they will lose with certainty against a mod- 
erate type unless they locate at the at the expected median. However, 
they are indifferent between wining in losing if both candidate locate at 
.5. So all the candidate uncertainty does is make the election outcome 
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more random. ‘This additional randomness mitigate the penalty for 
taking extreme positions. Therefore, the candidates polarize. 


7. Application: Campaigns, Contests and Auctions* 


An alternative perspective on electoral competition frames the com- 
petition as a contest. Candidates each select a level of costly effort, 
and a winner is chosen. More effort increases the likelihood that one 
wins. This approach allows us to focus on the role of money in cam- 
paigns. The model that we present here shares many features with 
models of auctions — a topic we take up in some detail in Chapter 11. 
Consider a set N = {1,...,n} of candidates that are running for office. 
Candidates compete by raising money and spending it on advertise- 
ments. Let a; € R4 denote the level of fundraising by candidate 7. 
Given the accumulations a = (ay,...,@,), the winner is determined by 
the function p(a) : R) — N. This function should be weakly increas- 
ing. A reasonable example includes the mapping p(a) = arg maxjey 4; 
which awards the office to the candidate that raises the most money.” 
Candidate 2’s utility depends on the identity of the winner, the level of 
accumulation a; and the candidates value of winning office, 6; € [0, 1]. 
For simplicity we assume that the values to winning office (types) of 
each candidate are private information independently drawn from a 
uniform distribution on [0,1]. Specifically, candidate 7’s utility takes 
the form 


Ui(a) = Gil fp(a)=ay — Gi 

where 14,()=;} is an indicator function that takes the value 1 if p(a) = 7 
and 0 otherwise. In this Bayesian game each candidate simultane- 
ously selects their level of a; and then the payoffs are realized. In the 
language of auction theory, this is a first price all play auction with 
independent types. A Bayesian Nash equilibrium is a function (for 
each candidate) that 0; into a;. Once again, we focus on symmetric 
equilibria in which each candidate uses the same strategy. 

Directly solving the model for continuous strategy functions is often 
quite difficult. A trick is to assume that the strategy function have a 
specific functional form, solve for any free parameters, and verify that 
the solutions constitute equilibria. Here we conjecture that players 
j # i use a strategy of the form, a;(0;) = 605 where b and c are 
parameters to be determined. If players 2,...,n use the conjectured 


5 An alternative interpretation of this model is to treat a; as the level of effort 
or time that the candidate spends running for office. 
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strategy, and candidate 1 selects a, then the probability that 1 wins is 
pr{max;4; b05 < ai}. This probability is 


{ame <()'}= (3) 


Accordingly the expected utility to player 1 with type 6, from action 
ay is 


n—-1 
ay ) Cc 
— 6, — ay. 
( b Pe gest 
Differentiating with respect to a, yields the first order condition 


n—1—c 


pce (2) any 


cb 


and solving for a; yields 


Cc 


(6.3) aaa (<r) a 


We began by conjecturing that players 7 = 2,...,n used a strategy of 
the form a,;(0;) = b@; and found that player 1’s best response was to 
use a strategy of the form given above. An equilibrium can then be 
found by solving for values of b and c such that 


(6.4) bo =b (<a) aoe 


Suppose that 6 = oa and c = n then substitution into the right hand 
side yields 


(65) a = 2 (aa | _ 


This simplifies to 


confirming that b = not and c = n correspond to an equilibrium. 


Thus, a symmetric Bayesian Nash equilibrium is for each candidate to 
accumulate i 
n— n 

The relationship between this model and other auctions can easily be 
seen. In this game, a candidate suffers disutility a; regardless of 
whether or not she wins. An alternative model might involve each 
agent announcing promises to pay if they win. Another example of 
this form would involve interest groups that make promises to bribe a 
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committee chairman if their preferred nominee is conferred an appoint- 
ment. In the chapter on mechanism design we will consider auctions of 
this type and trace out the relationships between equilibria to different 
types of auction. 


8. Existence of Bayesian Nash equilibria* 


When will a Bayesian Nash equilibrium exist?. In considering 
this question it is useful to note that the Bayesian normal form games 
considered are really just special cases of Normal form games, in which 
each player simultaneously selects a strategy (where a strategy is a 
mapping from ©; into S;) and the payoffs are defined as the agents’ 
expected utility over strategy profiles. An alternative statement is 
also true, a Bayesian game is equivalent to a normal form game in 
which every possible agent-type pair is a player of the normal form 
game. ‘The requirement that a Bayesian Nash equilibrium involves 
strategies that are best responses for every possible type is equivalent 
to the requirement that in this larger normal form game every player 
(a player-type pairing) select a best response. This observation allows 
us to apply our previous results to establish the existence of Bayesian 
Nash equilibria in Mixed strategies to Bayesian normal form games 
with finite type and action spaces. 

Specifically, let us start with a Bayesian game (NV, 5,0, u,p) with 
N, 5S, © all finite sets. Without loss of generality we denote types in the 
following manner, 0; = {6}, ...., a We can define a new normal form 
game I” as follows: Let N’ = {6},..,0%°,.05,...,0%"}. In this normal 
form game all agents with subscript i have strategy space Si = Sj. 
Let O_; = Xj=n\;O; denote the set of possible type profiles for the 
agents N\i in the original Bayesian game. Given a strategy profile 
s+ = (st,....,87,.....sk) © x',S! to the game I’ we can identify this 
strategy with one in I by letting s/(6; = 67) = s!. The utility to 


% 


agent 6! is then defined by using the notion of expected utility in T 
vu] (st) = EU,(s} (6;), 8%; (-); 6). 


a 9-4 


The new normal form game I’ = (N’,S’,v) is then well defined, which 
leads to the result. 


PROPOSITION 6.1. Given the Bayesian game (N,S,0,u,p) with 
N,S,® all finite sets a Bayesian Nash equilibrium in mixed strategies 
exists. 


Proor. Given the Nash’s Theorem (Theorem 5.4), the finite game 
(N’,S’,v) has a Nash equilibrium in mixed strategies. Let o/ denote 
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the lottery over S; that such a mixed strategy equilibrium specifies. 
Since the profile o satisfies the condition for a Nash equilibrium, the 
strategy o;(0; = 67) = ¢,(6;) will satisfy the condition 6.1. O 


9. Exercises 


EXERCISE 6.1. Consider the jury voting game where p = 3 and 


T= 2. Characterize the set of BNE to the game. Now assume that 
instead of majority rule, a version of unanimity rule is used — if all 
agents vote to convict the defendant is convicted, if at least one agent 
votes to acquit the defendant is acquitted. Characterize the BNE to 


this game (again assuming that p = * and 7 = 2). 


EXERCISE 6.2. Consider the Jury Voting Game with a continuum 
of types. Prove equation 6.2. 


EXERCISE 6.3. Consider a version of the Palfrey-Rosenthal model 
where k contributions are required for the provision of the public good. 
However, assume that contributions are refunded if there are fewer than 
k. How does this modification effect the value of the cutpoint c? Now 
suppose that contributions in excess of k are returned randomly to the 
agents. What happens? 


CHAPTER 7 


Extensive Form Games 


Normal form representations of games are static in that all players 
choose their strategies simultaneously. However, many applications in 
political science involve players choosing strategies in sequence or in 
multiple stages. As we will see, it is possible to model these games in 
the normal form. However, it is often more convenient to model these 
games in the extensive form. 

To motivate the extensive form, consider the following application 
from international relations. Assume that there are two countries 
A and B who are involved in a dispute over territory. We assume 
that B controls the territory in question, thus the first stage of the 
game involves A’s decision about whether to initiate conflict by moving 
troops into the disputed region. After observing whether A initiates, B 
then decides whether to acquiesce and let A maintain control or escalate 
in an attempt to expel A’s army from the territory. If country decides 
to escalate, it is successful in repelling A with probability p. 

We assume that, apart from the resources of the disputed region, 
each country has a national wealth of aj and bp and that the territory 
is worth an additional 4 units of national wealth to each country. A’s 
invasion of the region costs one unit as long as B does not attack. 
However, an escalation by B costs each country 6 units. The following 
table gives the payoffs from each of the possible outcomes. 


Table 7.1: Escalation Game 

If A does not initiate and B acquiesces, (dg, bo + 4) 
If A does not initiate and B escalates, (ao — 6, bo — 2) 

If A initiates and B acquiesces, (a9 + 3, bo) 
If A initiates and B escalates (ap — 6 + 4p, bo — 6 + 4p) 


Suppose we were to model this game as a normal form where A 
chooses whether or not to initiate and B decides whether to acquiesce 
or escalate. Then we would be ignoring the fact that B knows A’s 
choice when it makes its decision. 

A better way of representing this game is using a game tree as 
in Figure 7.1. A game tree consists of nodes representing all of the 
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decisions made to date. Alternatively, we say that the nodes represent 
the histories There is an initial node at the beginning of the game 
(representing that nothing has happened). At each node there are 
branches representing the actions available to the player who chooses 
at that node. Each of these branches lead to the nodes of the next 
stage. The end of the game is represented by terminal nodes which 
specify the payoffs for each play of the game. 


Insert Figure 7.1 Here 


At the initial node of our war game, A makes its decision at the ini- 
tial node from which there are two branches corresponding to initiate 
and not initiate. At each of the subsequent nodes, B makes its choice 
between acquiesce or escalate. At each of the four terminal nodes, 
the corresponding payoffs are denoted. 

Just as a matrix is used to represent normal form games, the game 
tree is a representation of the extensive form. The elements of the 
extensive game are: 


(1) The set of agents N 

(2) A set of histories of the game H. The elements of H corre- 
spond to nodes of the game tree. HH” is the set of terminal 
histories. By convention, the initial node is represented as 
H® = {6}. 

(3) A mapping p(h) : H\H? — N assigns to each non-terminal 
history h an agent who must make a decision at h. 

(4) A set of actions A(h) that p(h) may take following history h. 
These may involve randomizations over actions. 

(5) Information sets J C H\H* which form a partition of the 
set of histories. If h € J, p(h) is uncertain whether she is 
at node h or some other node h’ € J. In the game above, 
each player knows the history when it is called upon to play so 
that each information set contains a single element. We call 
such situations games of complete and perfect information (or 
simply perfect information). This assumption will be relaxed 
in later sections so that players may not observe all actions 
preceding their moves so that some information set J contains 
multiple elements. These are games of complete but imperfect 
information (or simply imperfect information). 

(6) Payoffs U, a list of Bernoulli utility functions u,;(h) : H? > R! 
for each i € N. 

Thus, a finite extensive from game I” is the collection (N, H, p(-),U). 
In the extensive form, a strategy is a complete plan of action. There- 
fore, it specifies a feasible action for the player in every history that 
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the player might be called upon to act. A formal definition of strategy 
follows. 


DEFINITION 7.1. Given an extensive form game I”, a strategy 
profile for player i € N is a mapping s;(h) : H; > A(h). A strategy 
profile is a mapping s(h) : H\H? — A(h). 


Given this definition, we can specify the strategy sets for both play- 
ers. Since A only moves at the initial node, it strategy sets is simply 
{initiate, don’t initiate}. However, B strategies must be conditioned 
on each history. Thus, B must choose from the following strategies: 
(always acquiesce) , (always escalate) , (escalate if initiate, acquiesce otherwise) , 
(acquiesce if initiate, escalate otherwise). 
Now that we have specified the strategies, it is easy to see that we 
can also represent this game as a normal form. 


Table 7.2: Escalation Game in Normal Form 
B\A Initiate Don’t Initiate 
(always acquiesce) ag + 3, bo ao, bo + 4 
(always escalate) ay — 7+ 4p,bp9 —-6+4p | ap — 6, b9 — 2 
(escalate if initiate, acquiesce otherwise) | a9 — 7+ 4p,bo9 —-6+4p]| ao, bo +4 
(acquiesce if initiate, escalate otherwise) ao + 3, bo ay — 6, b9 — 2 


With this representation, it is easy to see that there are three Nash 
Equilibria. The first two are the strategy profiles: 


(Initiate, (always acquiesce) ) 


and 
(Initiate, (acquiesce if initiate, escalate otherwise) ) 


Each of these predict that A will invade the disputed region and B will 
not respond. The third Nash equilibrium is the profile 


(Don’t Initiate, (escalate if initiate, acquiesce otherwise) ) 


Of course, the third equilibrium predicts that A will be deterred from 
entering the disputed region by the threat that B will escalate. 

This example shows some of the limitations of the Nash equilib- 
rium concept in dynamic games. In particular, the predictions in the 
second and third equilibria are somewhat implausible. First, con- 
sider the second equilibrium. Suppose that Country A defected from 
its equilibrium strategy and decided not to initiate. The equilibrium 
then calls for B to escalate the conflict even though A did not initiate. 
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Clearly, B is worse off by carrying out its strategy. Thus, Nash equi- 
libria allows for behavior that is not rational at histories that are “off 
the equilibrium path.” One might discount the problem in the second 
equilibrium since it predicts the same behavior as equilibrium 1 which 
does not suffer from the problem — B is happy to acquiesce if A does 
not initiate. However, consider how the problem emerges in the third 
equilibrium. Again suppose that A defected by choosing to initiate. 
If this happens B is clearly better off acquiescing for any value of p. 
Thus, B’s threat to escalate is not credible. It would never rationally 
carry it out if it were called on to do so. Thus, the “peaceful” outcome 
is built on behavior which is not sequentially rational i.e. rational at 
every possible information set. 

In the next couple of sections, we discuss refinements of Nash equi- 
libria appropriate for dynamic games which eliminate strategies which 
are not sequentially rational. Next we discuss the concept of back- 
ward induction which eliminates non-credible threats and sequentially 
irrational behavior in games of perfect information. Then we will in- 
troduce games of imperfect information and the refinement of subgame 
perfection. 


1. Backward Induction 


The most common way of solving dynamic games of perfect infor- 
mation is through backward induction. In backward induction, we 
assume that the last player to act chooses the action at each node 
that maximizes her utility. The second to last player then chooses 
his actions optimally knowing that the last player will choose optimal 
actions at each node. This process is continued until each player has 
chosen optimally under the assumption that all future players will make 
optimal choices at each history. 

It is easy to apply backward induction to our conflict game. First, 
we require that B make optimal choices at each node. At the initiate 
node, B clearly gets a higher utility from acquiescing The same is true 
at the don’t initiate node so that A knows that B will always acquiesce. 
Given this knowledge, A optimally chooses to initiate. Thus, the 
solution from backward induction is (Initiate, (always acquiesce) ) — the 
Nash equilibrium that did not involve sequentially irrational behavior. 

Let’s consider some other examples before formalizing the proce- 
dure. 


1.1. Application: The Centipede Game. Figure 7.2 presents 
a game tree that has been studied extensively in experimental econom- 
ics, known as the Centipede game. ‘Two players take turns choosing 
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between Down and Left. The choice of D ends the game, but L con- 
tinues it until stage 5. One of the reasons that this game is of such 
interest to experimentalists is that a naive player 1 may attempt to 
continue to play L in order to get the large payoff of 10 at stage 5. 
However, we will see such a strategy is not sequentially rational and 
does not survive backwards induction. We begin at stage 5 where 
player 1 will clearly play L. Backing up to stage 4, player 2 knows 
that player 1 will play left in the last stage which would give her a pay- 
off of —10 so she does better playing D. Backing up one more stage, 
player knows that L generates —4 while D guarantees 3. Thus, he 
chooses D. Clearly, if we continue this process back to the first stage, 
we see that in fact player 1 will rationally choose D. Indeed, the only 
strategy profile that survives backward induction is {D, D, D, D, L}. 


Insert Figure 7.2 


1.2. Application: Sequential Bargaining. The application of 
bargaining models has become increasingly important in political game 
theory. Indeed, we dedicate an entire chapter to it later in the book. 
Here we consider one of the simplest versions. Assume that there are 
two players, 1 and 2, who are bargaining over how to allocate $1. In 
the first period, player 1 proposes a division of the dollar where she 
keeps x; and gives 2 = 1 — x, to player 2. If player 2 accepts, the 
dollar is divided accordingly and the game ends. However, if player 2 
rejects, the value of the dollar decreases to 6 where 1 > 6 > 0. This 
is intended to capture the fact that the players are impatient in that 
they prefer to settle sooner than later. In round 2, player 2 may make 
an offer such that she keeps x2 and gives 7; = 6 — £2 to player 1. If 
player 1 accepts, remaining 6 is divided. However, if she rejects, the 
dollar disappears and both players get 0. For simplicity, we assume 
that the payoffs to each player are u;(x;) = xj. 

It turns out that there are lots of Nash equilibria to this game. In 
fact any allocation can be supported with Nash equilibrium strategies. 
To see this consider, the following strategy combination: 

Player 1: Propose xr = z. If it is rejected, reject any offer in round 
2. 

Player 2: Accept in round 1 if x2 > z, reject otherwise and then 
propose v2 = 6 in round 2. 

Clearly, the best response of player 1 is to propose x2 = z in round 
1 for any z < 1. Otherwise, player 1 would receive 0. Similarly, 
player 2’s best response is to accept z. However, these strategies are 
clearly not sequentially rational. Player 1 does not profit by rejecting 
all second period proposals. He should accept any proposal that gives 
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at least as high a utility as the 0 received from rejecting. Thus, player 
2 will get to keep all 6 in round 2. Thus, she will accept in round 1 
if and only x2 > 6. Knowing this, player 1’s best proposal in round 1 
is 7, = 1—6 and rg = 6. This outcome is the only Nash equilibrium 
that survives backward induction. 


1.3. Solving Games via Backward. Having seen a few exam- 
ples, we can to generalize the notion of backward induction. Let H7~! 
be the set of histories of play that can be reached at stage T—1. At 
each of these histories h € H’~', backward induction the player who 
acts, p(h), to choose its action optimally to maximize her utility. Thus, 
for each h € H*~1, p(h) selects a*(h) = arg maxge(n) Up(n)((h, @))- 

Next consider the set of histories that immediately proceed H7~!and 
only lead to histories in H~!(we denote this set by H7~?). For each 
h € H?, p(h) select the action a € A(h) which is optimal for p(h) 
given the choices made from H*~! or a*(h) = arg maxgea(n) Up(ny((h, a, a*(h, a))). 
This process can be iterated to stage k where we solve for a*(h) = 
arg MAXge A(h) Up(n)((h, a, a*(h,a))) for each h € H*~*. This process 
continues to the initial node H° = {0}. 


2. Dynamic Games of Complete but Imperfect Information 


So far we have only considered models in which at every stage, the 
player who moves knows all of the previous moves and so knows exactly 
which game node she is at. Using the terminology of the last section, 
all information sets contain a single element. Now we consider models 
in which information sets contain multiple histories. Games of this 
form are said to have imperfect information. This can occur either 
because not all moves are observed or because some moves are taken 
simultaneously. 

Let’s first look at a simple game where some actions may not be 
observable. Consider a game between a bureaucrat B and a politician 
P. The bureaucrat has to choose a regulatory enforcement level from 
{H, L} which represent high and low respectively. High enforcement 
is costs c > 0 to B but low enforcement is assumed to be costless. 
To keep things simple, we will assume that B gets no utility from its 
enforcement. Therefore, it gets —c for H and 0 for L. However, 
assume that P prefers H to L and that up(H) = 1 and up(L) =0. P 
cannot observe B’s enforcement level unless it conducts oversight of B 
at acost 1 >k>0. If B is found to have chosen the lax enforcement, 
it suffers a penalty f which we assume is greater than c and is forced 
to choose H. 
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The main difference between this game and ones we have seen before 
is that P does not know whether the history is H or L at the point at 
which she has to decide whether to conduct oversight. Consequently, 
I(H) = {H, L} and I(L) = {H,L}. In extensive form given in Figure 
7.3, we denote that H and L are in the same information set by connect 
the nodes with dotted lines. 


Insert Figure 7.3 Here 


Since P does not observe B’s action, she must play the same action 
at each node. Since a strategy by B is simply a choice at the first 
node, we can write this game in the normal form. 


Table 7.3: Oversight Game 
B\P | Oversight | No Oversight 


Given the assumption that f > c, there are no pure strategy Nash 
equilibria in this game. If B chooses H, P’s best response is to not con- 
duct oversight, but the best response to no oversight is low enforcement 
in which case P would prefer oversight. The mixed strategy equilib- 
rium of this game involves B choosing the high enforcement level with 
probability 1 — k and P conducting oversight with probability + 

Now we turn to a familiar example to illustrate how the extensive 
form can accommodate simultaneous actions. The trick is to treat 
simultaneous moves as sequential ones in which subsequent players do 
not observe the action taken. Consider the prisoner’s dilemma where 
two crooks have to decide whether or not to confess. We can model 
this is extensive form by letting player one move first and then placing 
both confess and don’t confess in the same information set for player 
2 as we have done in Figure 7.4. 


Insert Figure 7.4 Here 


Finally to show how flexible the extensive form can be, consider the 
abstract game with three stages in Figure 7.5. Each player has three 
moves Left, Middle, Right. When player 1 plays left it is observed, 
but when player 2 plays right it is observed. Player 2 therefore has two 
information sets {ZL}, {M,R}. Player 3 has four {LL,LM}, {LR}, 
{ML,MM,RL,RM}, and {MR, RR}. 

Insert Figure 7.5 Here 


The technical and conceptual difficulty with games of imperfect 
information is that we no longer can apply backward induction since 
players do not know which node they are on. We need a more general 
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notion of sequential rationality. Rather than assume that each player 
takes the best action given the node they are on, we assume that at 
each stage that can be conceptualized as a distinct game all the players 
play Nash equilibrium strategies. This concept is based on the idea of 
the subgame. A subgame is a subset of an extensive form that satisfies 
the following criteria: 


(1) It begins at a node that is a singleton information set. 

(2) It includes all nodes following this initial node, but only nodes 
that follow the initial node. 

(3) It does not cut any information sets. If a histories h and h’ 
are in the same information set, there are part of the same 
subgame. 


The example in Figure 7.5 has three subgames: the original game, 
a subgame following L, and a subgame game following the history LR. 
A formal definition of the set of subgames follows. 


DEFINITION 7.2. Given an extensive form game I”, the set of sub- 
games are all of the extensive form games constructed by selecting all 
h € H\H? which are singleton information sets and restricting H, p(-) 
and u;(-) to histories that can be reached from h. 


Given the definition of subgames, the new requirement of sequential 
rational is that all agents play Nash equilibria in all subgames. ‘Thus 
requirement is known as subgame perfect Nash equilibrium or SPNE. 


DEFINITION 7.3. Given an extensive form game I“, a strategy pro- 
file s(-) 1s a subgame perfect Nash equilibrium (SPNE) if in every sub- 
game to T® the restriction of the strategy profile s(-) to the subgame is 
a NE of the subgame. 


An important result establishes the existence of SPNE for finite 
games. 


THEOREM 7.1. Every finite extensive form game has a SPNE. More- 
over, if no player is indifferent between any two terminal histories then 
the SPNE is unique. 


As an example we will consider a problem of sequential voting by 3 
players N = {1,2,3}. Suppose that the choices x, y, z are to be voted 
on with the agenda: choose between x and y first, and then compare 
the winner with z, enacting either the winner from the first vote or z 
depending on which proposal gets the most votes. We assume that at 
each stage of voting ballots are cast simultaneously. Figure 7.6 depicts 
the game tree 
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Insert Figure 7.6 Here 


We assume that the player have the following preferences over the 
enacted policy 7P,yP,z; yP2z Pox; zP3xP3y. Applying subgame perfec- 
tion and requiring that strategies are not weakly dominated (so voting 
is sincere) we see that if the final vote is between x and z then players 
2 and 3 will vote for z. In contrast if the final vote is between y and 
z then players 1 and 2 will vote for y Accordingly, in voting over x 
and y in the first period, strategic agents will anticipate that the real 
choice is between the sophisticated equivalents, z and y. Accord- 
ingly players 1 and 2 will vote for y over x. Note that player 1 prefers 
x to y, but in a SPNE she casts a strategic vote for y over x because 
she realizes that a vote for x is really a vote for z which she finds very 
unappealing. 

It is important to note that if we drop the requirement that voters 
do not use weakly dominated strategies, the set of SPNE can be quite 
large. Recall that any unanimous vote is a Nash equilibrium in any of 
the subgames at the second stage of the agenda. Thus, a large number 
of SPNE can be constructed by specifying different Nash equilibrium 
strategies for each second stage subgame. 

As a second example, consider a model similar to one used by Wein- 
gast (1997) to explain the development of the rule of law. This game 
consists of a ruler R who can choose whether or not to expropriate 
wealth x from one of two social groups A or B. After observing which 
group the ruler attempts to expropriate, A and B decide simultaneously 
whether or not to challenge him. Each incurs cost c from challenging. 
If both challenge, the attempted expropriation fails and each receives 
a benefit b. A successful challenge also costs the ruler k. If one or 
zero groups challenge, the expropriation succeeds. The extensive form 
is shown in Figure 7.7. 


Insert Figure 7.7 Here 


We begin our analysis by computing the Nash equilibria of the 
subgame following the decision to expropriate from A. The normal 
form for this subgame can be represented as: 


Figure 7.4: Expropriation Subgame 
B\A Challenge | Don’t Challenge 
Challenge b, b —£,-—C 
Don’t Challenge | —x — c,0 —2,0 


Clearly, there are two pure strategy Nash equilibria to this sub- 
game corresponding to both groups challenging and to both groups 
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not challenging. There is also a mixed strategy equilibrium but we 
will ignore it to keep the example simple. Since the subgame follow- 
ing an attempted expropriation of B is symmetric, there are also two 
pure strategy equilibria corresponding to both challenging and neither 
challenging. 

Now we can back up to the first stage of the game where RF antici- 
pates that the Nash equilibrium that is played in each of the subgames 
of the second stage. Suppose he anticipates that both groups will chal- 
lenge in both subgames, then f’s best response is not to expropriate. 
Suppose that the groups challenge in only one of the subgames but not 
the other. Then FR will expropriate appropriate group. If there is no 
challenge in either subgame, the ruler might expropriate either. Thus, 
there are five SPNE in pure strategies. Weingast argues that the key 
to establishing the rule of law is that A and B coordinate on the Nash 
equilibrium where they both challenge attempted expropriations by the 
ruler. 

It is important to note that solution via backward induction is just 
a special case of SPNE. Since in a game of perfect information, all 
information sets are singletons, each node begins a new subgame of the 
extensive form. Clearly, optimization at every node constitutes a Nash 
equilibrium of all subgames. Therefore, any solution using backward 
induction is a SPNE. 


3. Subgame Perfection and Perfect Equilibria 


One of the justifications of SPNE as a solution concept is its close 
relationship to Selten’s perfect equilibrium concept. To see the close 
link, consider the normal form representation of our crisis game and 
the Nash equilibrium profile 


(Don’t Initiate, (escalate if initiate, acquiesce otherwise)) . 


Suppose that we computed a completely mixed equilibrium where each 
strategy had to be played with at least probability ¢«. In particu- 
lar, assume that country A initiates with probability «. Then coun- 
try B’s expected utility from (escalate if initiate, acquiesce otherwise) 
is (1—e) (bo + 4) + € (bo —6+ 4p) while its utility from always ac- 
quiescing is (1—¢) (bo) + 4) + ebo. Since the expected utility from 
(always acquiesce) is larger for any ¢« > 0, B will want to play it 
with the maximum probability in the mixed equilibrium. As a result 
(escalate if initiate, acquiesce otherwise) cannot be the limit of com- 
pletely mixed Nash equilibria. Thus, it not only fails the requirements 
of SPNE, but it is not a perfect equilibrium either. 
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However, while all SPNE are perfect, there are perfect equilibria 
that are not subgame perfect. This problem arises because extensive 
form games represented in the normal form often generate correlation 
in the trembles when the same player moves more than once in the 
extensive form. 


4. Applications 


4.1. Agenda Control. 

4.1.1. The Romer-Rosenthal Model. Suppose, as is the case in many 
localities in the U.S., that local school budgets have to be approved by 
the voters. Further suppose that only the school board can place 
the measure on a referendum ballot. Thus, formally, the board has 
monopoly agenda control over proposals for school spending s € [0, 00). 
We assume that the school board would like to maximize the amount 
of spending so that uz (s) is always increasing in s. 

Once the referendum has been placed on the ballot, voters decide 
via majority rule whether or not to approval it. We assume that all 
voters will turnout so that their only possible strategies are {Y, N}. If 
a majority chooses Y, then s becomes the new level of spending. If 
a majority chooses N, then some reversion (or status quo) spending 
level q is adopted. We assume that voters have single peaked and 
symmetric preferences over school spending wu; (s). Let vu; be the ideal 
point of voter 7. As we showed in chapter 2, such preferences take the 
form wu; (s) = h(—|s — v,|). 

We propose to solve this game using subgame perfect equilibrium. 
However, note that since the last stage of the game is a majority rule 
voting game, there are always Nash equilibria where the proposal is 
accepted and equilibria where it is rejected for any s. Thus, we will 
assume that voters do not use weakly dominated strategies in the voting 
subgame. Thus, given any proposal s, each voter will vote Y if u;(s) > 
ui(q). Note that single-peakedness implies that if voter 7 prefers q to 
s, all voters with ideal points to the left of 7 do so as well. Further, if 
voter 7 prefers s to q, all voters to the right of 7 do so as well. Thus, 
if the median voter votes Y then the proposal will pass. 

Given the equilibrium of the voting subgame, the school board’s 
best response is to choose the largest s that is acceptable to the median 
voter. Let uv, be the ideal point of the median voter. Given v,, 
and q, we can compute which policies that the median prefers to q or 
Um(S) > Um(q). Note that since h is a non-decreasing function, this 
inequality requires that 


=|3' 0 |g =U 
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Therefore, if ¢ < Um, this inequality holds for s € [q,2um—q]. Con- 
versely, if ¢ > Um, a successful proposal requires s € [2Um — q,q|. Thus, 
the highest obtainable policy the school board can get is the maximum 
of 2u,—q and q. Since the board wants to maximize s, it will choose 
max{q,2U, —q}. Figure 7.8 plots the equilibrium value of s* as func- 
tions of v,, and q. 


Insert Figure 7.8 Here 


This simple model produces some clear predictions about the re- 
lationship between voter preferences, statutory reversions, and policy 
outcomes. First, note that the board can use its agenda control to 
generate higher spending outcomes when the statutory reversion is low 
so long as the median prefers to spend more than the reversion amount. 
This is because the voter’s threat to reject large spending proposals is 
not credible when the reversion is bad. A second important implica- 
tion is that while spending outcomes are responsive to changes in voter 
preferences (at least when ¢ < v;,), spending grows twice as fast as the 
median voter’s preferred spending level. 

4.1.2. The Presidential Veto. In the United States and many other 
presidential systems, the executive has a veto power over legislative 
enactments. We can use a version of the Romer-Rosenthal model to 
explore how the veto enhances the executive’s influence over legislation. 

To keep things as simple as possible, we will model the legislature 
as a single actor L who has single-peaked symmetric preferences on a 
single dimension with an ideal point of |. Thus, we denote the legisla- 
ture’s preferences as w(x) = h(—|a —1|) for policy outcomes x € R. 
Extending the model to the case where the legislature is a collectivity 
is straightforward. Similarly, we assume that the president’s has an 
ideal point p and preferences given by uy (x) = h(— |x — p)). 

The game form is very simple. In the first stage, L proposes a 
bill b to change the status quo policy g. Subsequently, the president 
P decides whether to accept b or to veto it the bill which results in 
maintaining the status quo g. Thus, we ignore the legislature’s ability 
to override vetoes, an issue we take up in the next section. 

We can solve this game very easily using backward induction. Clearly, 
in the last stage, the president’s best response is to accept any bill for 
which u, (b) > up (gq) or —|b— p| > —|q—p|. Thus, if p > q, she will 
accept any b € [q,2p—q]. Alternatively, if p < q, she will accept 
b € [2p —q,q|. Let P(q) denote the set of bills that the president will 
accept over the status quo. Now we back up to the legislature’s deci- 
sion node. Since the legislature knows which policies will be accepted, 
it will choose its most preferred policy from P(q). If! € P(q), then 
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clearly b* =1. If cis below min P(q), then b* =min P(q). If] > max 
P(q) , then b* =max P(q). 

Suppose that | > p. Then, given our derivations of P(q), the equi- 
librium policy outcome is 


2p—qifp>qand!l>2p—q 
lifp>qand!l<2p—q 
lifl<q 

qifl>q>p 


b* = 


If p > 1, the equilibrium outcomes are 


2p—qifp<qand!l<2p-—q 
lifp<qandl>2p—q 

ig er: 

qifl>q>p 


C= 


Figure 7.9 plots the equilibrium outcomes as a function of 1, p, and 
q. The comparative statics results are quite similar to the original 
Romer-Rosenthal model. In particular, the legislature does better off 
when the status quo is far from the president’s ideal point. Another 
important implication is that the influence conferred by the veto is not 
very large. In all of the cases that the veto has some impact i.e. b* F J, 
the president is indifferent between the equilibrium proposal and the 
status quo. Finally, since the model is one of perfect information, 
the legislator perfectly predicts the president’s behavior and no vetoes 
occur in equilibrium. In later chapters, we consider models in which 
vetoes may occur as part of equilibrium strategies. 


Insert Figure 7.9 Here 


4.1.3. The Veto Override. Now we consider a simple extension to 
the model of the previous section. Instead of assuming that q is the 
outcome following any veto, we consider a model where the legislature 
can override the veto with a supermajority. Assume that the legisla- 
ture has n members and that k > nit votes are need to override the 
executive veto. Further, assume that each legislator has single peaked 
preferences of the form wu; (x) = h(— |x —1,|) and that the ideal points 
I; are ordered such that 1; > 1; if and only ini > 7. Motivated by a 
model in which legislative proposals are made according to an open rule 
agenda, process, we assume that the legislative proposer is the median 
with ideal point m = [(n41)/2. 

Given these assumptions (most importantly single-peakedness), a 
successful override requires that ux (b) > ux (q) and Un—p—1 (b) > Un—r-1 (q)- 
To see that this is true, consider the case where u;(b) > ux (gq) and 
Un—k—1 (b) < Un—p—1 (q). Since preferences are single-peaked, this means 
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that there is some i € [n — k — 1,k] such that u; (6) < u; (q) for all leg- 
islators with ideal points lower that 1;. Therefore, the number who 
support the override must be strictly less than k. The logic of the 
other possibilities is similar. Because their support is necessary and 
sufficient, legislators n — k — 1 and k are commonly referred to as the 
override pivots. 

Since an override is only necessary in case of a presidential veto, only 
one of the override pivots is strategically relevant. To see this consider 
a hypothetical vetoed bill where u,(b) < up(q) and um(b) > Um(q). If 
p <m, single peakedness and the fact |, > m imply that uz(b) > ug(q). 
Thus, the override depends solely on n — k— 1’s preferences. Similarly, 
if p > m, single peakedness and |,-~-1 < m imply that up—,-1(b) > 
Un—r_1(q). The implication is that only the preferences of the pivot 
that lies on the same side of the median as the president matter for a 
successful override. 

Thus far we have established that it is necessary for the proposer 
to attract the support of either the president or the override pivot 
on his side median. Now we consider how the proposer chooses her 
optimal proposal. First suppose that l,-.-1 <p < m. Again using 
singlepeakedness, we know that any bill that /,_,-1 and m prefer to 
q, p also prefers. Thus, the proposer does not require the support of 
l,-z~-1- A similar argument establishes that when p < 1,_,%—1 < _m, the 
proposer need only attract n — k — 1’s support. The corresponding 
cases where p > m are symmetric so that we know that the proposer 
need only attract the closer of the president and the override pivot on 
the president’s side of the median. 

Thus, we can define the pivotal actor and her ideal point as v = 
max (ln 4-1, p) if p < m and v = max (Ix, p) otherwise. Furthermore, 
the game can be treated as a direct application of the Romer-Rosenthal 
model where the ideal point of the veto player is v. Thus, the SPNE 
proposal is given by: 


2u—qifv>qandm>2u—q 
m ifvu >qandm < 2u—q 


= : 
m ifm <q 
Git wi > gS wv 
if v > m, and 
2p—qifp<qandm< 2p-—q 
i" m ifp<qandm > 2p—q 


(EAL SG 
qifm>q>p 
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otherwise. Figure 7.10 illustrates the equilibrium outcomes for differ- 
ent values of k. Not surprisingly, when the number of votes need to 
override goes down, the effect of the veto power is diminished. 


Insert Figure 7.10 Here 


4.1.4. Structure Induced Equilibrium. Recall from chapter 2 that 
the core of multi-dimensional majority rule voting models is typically 
empty. Primarily to explain how legislatures overcame this “chaos” 
problem, Shepsle (1979) developed the idea of “structure-induced equi- 
librium.” The basic idea is that legislative institutions such as commit- 
tee systems restrict the types of legislative proposals and amendments 
that may be considered, and that such restrictions lead to non-empty 
legislative cores. 

While Shepsle’s initial work is developed within the paradigm of 
social choice, we present his various models as extensive form games 
which we solve for subgame perfect Nash equilibria. 

To keep things simple, we focus on a legislature with n members 
and two committees with ideal points C, and Cy. The policy space is 
assumed to be a subset of R?. Player C; represents a committee with 
jurisdiction over dimension 1 while Cy has jurisdiction over dimension 
2. In the absence of legislative activity, we assume that the status quo 
q = (q',q’) remains in force. Each legislator has quadratic policies 
(x1, 2?) given by 


Uj oa) =— (as — a: — (ae — as 


where (Ij, 1?) is the ideal points of legislator 7. We also assume that 
each committee has quadratic preferences with ideal points (c},c?), 
and (c3,c3) respectively. An important feature of these preferences is 
that they are separable across dimensions. Each players preferences 
on dimension 2 are independent of outcomes on dimension 1 and vice 
versa. 

We know consider several extensive forms representing various leg- 
islative institutions. 

The Open Rule with Germaneness. The first extensive form we con- 
sider is the open rule with a germaneness requirement. In this game, 
each committee sequentially reports a bill to change the status quo on 
its dimension. Thus, C; makes a proposal b; to change gq! and C2 
makes a proposal bz to change q?. Each bill may be amended, but only 
on the germane dimension. Thus, amendments to b; can only move it 
along dimension 7. 

Let’s begin in the last stage of the game where cz proposes by. Since 
amendments to move the bill along dimension 2 can be freely made, 
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the median voter theorem suggests that final outcome will be the ideal 
point of the median voter on dimension 2. We denote this ideal point 
as m? = median {I?}. Thus, cp has a weakly dominant strategy to 
propose by = m?. Note that this results depends on the fact that 
preferences are separable so that the outcome on dimension 1 does 
not effect dimension 2 preferences. Clearly, by the same logic, c’ will 
propose b} = m' = median {l}}. Finally, note that the separability 
of preferences implies that the order in which the committees make 
proposals does not matter. 

The policy (m',m*) (or the dimension by dimension median) is the 
structure induced equilibrium under the open rule with germaneness. 
It is precisely because amendments can only be made on one dimension 
at a time that we get this generalization of the median voter result 
rather than a majority rule cycle. 

Gatekeeping. While in the last section, we assumed that each com- 
mittee had to make a proposal, now we assume that committees have 
discretion over whether to make a proposal at all. If committee 7 
“keeps the gates closed” on dimension j, the policy outcome remains 
q’. However, if the committee does report a a bill it can be freely 
amended subject to germaneness. 

First consider committee 2’s decision. If it opens the gates, the 
policy on its dimension will be m?. Thus, it exercises gatekeeping if 
— (b* — ch)? — (2 — 2) > — (bs — cd)’ — (m2 — 2)*. Simple algebra 
reveals that the committee will close the gates if and only if c} > 
gq? > m* or m? > q@ > c3. When one of these conditions hold, 
there are no policy revisions to the status quo on dimension 2 that 
the committee and the 2nd dimension median simultaneously prefer. 
Thus, the committee reports no bill. Since the case of committee 1 is 
symmetric, we obtain gatekeeping when cj > gq! > m' or m' > q! > cj. 
Thus, the policy outcomes (x"*, x”*)of the SPNE are 
oe) Pave a Sum orn Se Sc 

m/ otherwise 

The Closed Rule. Finally, suppose that committees make proposals 
under closed rules so that amendments are not allowed. Therefore, 
they are Romer-Rosenthal agenda setters within their jurisdictions. 
We can directly apply the results of section 4.1.2 where the proposer is 
c} and the vetoer is m/. Note that gatekeeping powers are irrelevant in 
this game since committee 7 can do at least as well as the gatekeeping 
outcome by proposing gq’. In fact, as Groseclose and Krehbiel (2002) 
point out, both c and the jth dimension median are better off under 
the closed rule than the open rule with gatekeeping. 
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4.2. A Model of Power Transitions. Our next application is 
based on the work of Powell (1999) who uses similar model to study 
how dramatic shifts of power in the international system might lead 
to violent conflict. Suppose that there are two countries A and B. 
Country A is making a claim against a region controlled by B. The 
total value of the region is normalized to $1 per period. We focus on 
a two-period version of this game and assume that each country weigh 
the outcome of each period equally. 

First consider country B’s options. It can appease A in each period 
t by offering it a share of the region’s output 0 < a < 1 or it can 
attempt to settle the dispute militarily by attacking A. If B chooses 
to attack and wins the war, A drops its claim and the game ends. If B 
loses the war, A takes undisputed control of the region. Country A’s 
available choices in period t is either to accept 2; or to refuse it and go 
to war. Fighting a war costs c to both sides. 

An important feature of the environment that Powell studies is that 
country A’s military capability is increasing relative to B’s over time. 
Assume that in the first period, A wins a war with probability p; where 
as in the second period A wins with probability po > p;. To keep things 
interesting, we assume that p2 > c. 

With respect to the incidence of violent conflict, there are two types 
of equilibria that we might observe: one in which B appeases A in both 
periods and one in which B attacks A in the first period. 

First, suppose that 2c > py — 2p;. Our claim is that in this case, 
the SPNE equilibrium is one where B gives py — c to country A in the 
second period and max{0,2p; — pz} in the first period. Since this is 
a game of perfect information, we can verify that these strategies are 
part of a SPNE using backward induction. In the second period, A’s 
expected utility of fighting is pg — c so that B must offer at least as 
much to avoid a conflict, leaving it with a payoff of 1—p2+c. Since B’s 
expected utility of fighting is 1 — po — ¢, it strictly prefers appeasing. 
Now consider period 1. A receives $1 in each period if it wins a war 
and 0 if it loses. Therefore, the expected utility of fighting is 2p, — c. 
Therefore, to appease A, B must choose 7; so that 7; +po—c > 2p, —c 
or 41 > 2p; — po. Since B will rationally offer the minimum amount 
xt = max{0, 2p; — po}. Now we need only check to see that B would 
prefer to pay xj}+23. Since B’s expected utility of war is 2 (1 — pi) —c, 
it prefers the payment if and only if 2c > py — 2p; + x} which always 
holds when 2c > pg — 2p. 

Now suppose that this condition does not hold so that p2—2p; > 2c. 
Since c > 0, this requires that xj = 0. Thus, B would prefer to fight 
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than to make the payments since p2 — 2p; + x} > 2c. Thus, when the 
condition fails, the SPNE has B attacking A in the first round. 

To generate intuition, note that the necessary condition for a peace- 
ful resolution only fails when po is much greater than p; so that A is 
much weaker in the first stage than it will be in the second stage. Thus, 
B prefers to attack when A is weak to avoid making large concession 
when A becomes more powerful. If the distribution of power were 
stable, their would be no war in equilibrium. 


4.3. A Model of Transitions to Democracy. Acemoglu and 
Robinson (forthcoming) develop a number of models designed to ex- 
plore the conditions under which authoritarian polities will adopt de- 
mocratic institutions. In this section, we provide a simple sketch of 
their framework and one of their models. 

Suppose that there are two types of agents: rich and poor. Let 
A’ > 5 be the proportion of citizens who are poor while 1 — is the 
proportion of rich citizens. Since these agent differ in their incomes, 
they have different preferences over tax rates. Rich citizens each receive 
income y" and poor citizens have income y?. The average income in 
the society is Ay? + (1—A)y". Clearly, y”" > 7 > y?. An important 
parameter in Acemoglu and Robinson’s analysis is 6 which represents 
the share of income held by the poor so that 


= 1-0)7 
a au and y" = eng i au 
Thus, an increase in @ represents a decrease in inequality. 

The primary policy instrument in this political economy is a linear 
tax and transfer scheme where the government sets a proportional tax 
rate 7 and then transfers the tax revenue back to the citizens in each 
lump sum. Given a tax rate 7, the per capita tax levy is T7. How- 
ever, as a simple way of capturing the distortionary effects of income 
taxation, Acemoglu and Robinson assume that revenues are lower than 
the levy by a function C(r)y. To keep things simple and get a closed 
form solution, we assume that C(r) = $77. Thus, the distortion is an 
increasing convex function of the tax rate. After deducting this dead- 
weight loss, the amount of money available for transfers is given by 
T = (7 — C(r))7 and the after-tax and transfer income of each agent 
is 


Vi(r) =(1—7) y+ (7 7 57°] 7 
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Now we consider the preferred tax rates by rich and poor voters. The 
first order condition for the optimal tax rate choice by poor voters is 
y—y’—Ty=0 
Using the fact that a we can write the poor’s most preferred tax rate 

as 
dad 
aN 
Since the poor have lower incomes than the rich, their income share 
is lower than their share in the population so that 4 > 6. Thus, 
0 <7? <1. Also note that 7? is decreasing in @ so that the poor’s 
preferred tax rate is increasing in inequality. 

Now consider the preferences of the rich. Their first order condition 

is 
gaya Ty =) 
This produces an infeasible negative tax rate. Thus, the rich’s most 
preferred feasible tax rate is 7” = 0. 

In Acemoglu and Robinson’s model, there is a political shock in 
each period which determines the consequences of overthrowing the 
regime and replacing it by a dictatorship of the left. They assume that 
when the shock is S, 1— 1° of the economy’s income is destroyed where 
S =H, Land pw" > pw". Thus, in state H the costs of overthrowing the 
regime are low compared to state L. During a revolution the income 
of the rich is confiscated and evenly divided among the poor, thus is 
state S the payoff to the poor is given by 


_ 

aR 

For simplicity, they assume that following the revolution the rich get 
no income so that V” (R, 2°) =(: 

To parameterize the outcomes of Acemoglu and Robinson’s model, 
we need to consider the extent to which revolution is a threat. We say 
that the revolution constraint binds in state S if the poor prefer revo- 
lution to an authoritarian outcome at the rich’s ideal tax rate of zero 
or that V? (R,u°) > V? (0). From substituting the above expressions, 
we find that this constraint binds when p5 > 0. 

Given this specification of the economy and the costs of revolution, 
we turn to one of Acemoglu and Robinson’s extensive form games, 
illustrated in Figure 7.11. To simplify the figure, we show only the 
extensive form following the realization of the state S. 


V? (R, n°) 


Insert Figure 7.11 Here 
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First, the state, H or L, is revealed. Then the rich move first and 
decide whether to move to a democracy D or to maintain control in an 
authoritarian system N. If the rich choose N, they also choose a tax 
rate T. 

After the rich make their decisions, the poor move next and decide 
whether to initiate a revolution R or to accept the rich’s decision (NR). 
If they revolt, the payoffs are V? (R, p>) = we and V” (R, p>) => 0: 
If they do not revolt against D, the tax rate is chosen by majority 
rule. Since the median voter is poor, the equilibrium tax rate is 7? 
and the payoffs to D are therefore V’ (7?) and V" (7?) for the rich 
and poor respectively. By comparing the payoffs of R and D, it is 
easy to establish that in state S the poor will prefer to revolt rather 
than accept democracy if and only if w% > 6+ 7? (A — 0) — $7”? or 


8 (A-9)? 
PP Oa 
Now suppose the rich chose N and the poor prefer not to revolt. 
Acemoglu and Robinson assume that the rich may not be able to com- 
mit to maintaining T > 0 after the revolutionary threat has passed. To 
model this commitment problem, they assume that with probability p 
rich maintain the tax rate 7 but with probability 1 — p they have the 
opportunity to renege and choose tT” = 0. Given the rich’s initial choice 
of tax rate and the possibility of reneging, we can compute that the 
utilities from N are 


ve(nA)= (=p) +o [dr + (7-5?) a 


1 
=U ED - g=a)> 570 


and 


if the poor do not revolt. Given these payofts, it is easy to see that 
the poor will prefer to revolt against N if 


1 
po >od+p Fae = 3) 
In order to reduce the number of cases, we follow Acemoglu and Robin- 


son and assume pu’ < 6 so that the poor never revolt in state L. This 
leaves us with three cases: 
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(1) Suppose that uw” < @. Then the revolution constraint binds 
in neither case. Thus, the unique SPNE consists of N , a tax 
rate of zero, and no revolution. 

(2) Suppose that p4 > ag eee . Then even democracy does not 
deter the poor from rewolling: so a revolution occurs. 

(3) Suppose that 6 + Gee > p# > 6. In this case, it may be 
possible to the rich to avoid a revolution by accommodating 
the poor with a tax rate T. From above we know that doing 
so requires that the Rich set the tax rate so that 


pH —6 
7(A— 0) — $A 


However, if p < ao nT = a ad the rich will prefer 


to choose D than to set the tax rate higher than 7?. Thus, 


p< 


there is a critical value of p* = ae) such that democracy 
is the outcome if p* > p. Thus, when the rich have difficulty 
committing to a high tax rate, they can avoid revolution by 
transitioning to democracy. 


To generate some predictions about when democratic transitions 
are likely to occur, we can look at how p* is affected by changes in 
the parameters. Not surprisingly, p* is increasing in pi” suggesting 
that when the costs of revolution are low, the rich is more likely to 
support democratization. Secondly, p* and the likelihood of democracy 
are decreasing in 6. ‘This occurs because greater inequality makes a 
revolution a more attractive option for the poor. In turn, the rich 
have to make more concessions to prevent it. If committing to these 
concessions is sufficiently difficult, a democratic transition will occur. 


4.4. A Model of Coalition Formation. One of the earliest ap- 
plications of political game theory is the study of coalition formation 
(Riker 1962). While the earliest models were developed within the co- 
operative game theoretic and social choice traditions, there have been 
a number of recent applications using non-cooperative bargaining mod- 
els. 

In this section, we look at one such model of coalition governments 
developed by Banks and Austen-Smith (1989). Assume that there are 
three parties a, 3, and y where 2 = {a,(,y}, who have known pol- 
icy positions pa, pg, and p, on a single dimension policy-space P C R 
where py > pg > py. Let w = {wa,wg,w} be the vector of votes shares 
for party in the last election where we assume that all vote shares are 
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less than ; so that the government has to be a coalition. To simplify 
matters, we will assume that these vote shares are exogenous parame- 
ters, whereas Banks and Austen-Smith derive them endogenously from 
a model of voting. We will also be interested in the vote shares for 
parties in various coalitions. Let C C 2 be a coalition then the vote 
share of each coalition is given by 


wo= 5 Wk, 


keC 


We say that C is a winning coalition if wo > $. So let D(w) = 
{C CQ: wo > §} be the set of winning coalitions and Dy(w) = {C C D(w) :k € C} 
be the set of winning coalitions that include party k. 

The three parties will bargain over the formation of a new govern- 
ment. In doing so, they will choose a policy y € P and allocate a fixed 
set of portfolios G. To keep things simple, we follow Austen-Smith and 
Banks and assume that G is infinitely divisible. We assume that the 
allocations g = {9a, 98, 9} satisfy occ gr = G. 

We assume that each party has quadratic preferences over policy 
and additive linear preferences in portfolios. Therefore, the payoff to 
party & to policy y and allocation g is given by 

—(y — De)? + Ik 

The protocol for bargaining is as follows. First, the party with the 
largest vote share, say k, is selected as formateur and chooses a coali- 
tion from D;,(w). The formatuer then proposes a policy yx and an 
allocation g. If its coalition partners accept, y, and g;, are imple- 
mented and the game ends. However, if one of the coalition partners 
vetoes, the second largest party becomes the formatuer, selects a coali- 
tion from D;(w), and proposes y, and g;. If this is defeated, the 
smallest party becomes the formatuer. If the smallest party is unsuc- 
cessful, a caretaker government takes office and maintains a status quo 
policy p, and chooses g = {0,0,0}. 

We can solve this game via backward induction. The payoffs 
to party k from a caretaker government are vg = — (pq — Pr) To 
simplify, we will assume that vg < —(p;—px)” for all j and k so 
that any party k prefers party j’s ideal point and a zero share of the 
portfolios to a caretaker government. Formally, this requires that 
Dq € |2Py — Pas 2Pa — Py] - 

We first consider the example of vote shares such that wy > wg > 
w.. So consider the third stage where party y is the formateur. By as- 
sumption, all parties prefer y, = p, and g, = (0,0, G) to the caretaker 
government so this must be party y’s optimal choice. 
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Now consider party (’s choice. The utilities of defeating any pro- 
posal by party 6 and moving to party 7’s proposal stage are v} = G 
and v? = —(p,—pa)°. Since y receives the highest possible utility 
from voting against (’s offer, 3 must make an offer to party a. How- 
ever, note that a prefers yg = pg and gg = (0,G,0) to a government 
formed by y, so a will accept 6’s ideal point and no portfolios. 

Now we back up to the first stage of the game where a makes a 
proposal. The utilities from defeating party a’s proposal and moving 
to party /3’s stage are now Us = G and uh = —(pg—p,)*. Clearly, 
a has nothing to offer 6 and will thus try to form a coalition with 
y. Thus, a will choose y. and g, to maximize — (yo — po)” +G — gy 
subject to — (Ya — py)” +94 > — (pg — py)’and G > g, >0. There are 
three cases depending on whether there are corner solutions g) = G or 
g, =U; 

(1) If pa — ps 2 Ps — py and G> 5 (Pa-P,) — (Ps — Py). Y= 
PoP and g* = + (Pa — Py) — (pe — Py) » 

(2) If pa — pp 2 Pp — py and G < i (Pa =P): — (Pe =D) 5 Vo. = 
Dy + G+ (es —,)” and 9, = G. 

(3) If pa — ps < Ds — Py; Yo = De and gy = 0. 


In the first two cases, the distance from a’s ideal point and pg is 
greater than the distance from pg to y’s ideal point. Thus, a is more 
willing to give up portfolios in favor of a policy better than pg than 
party 7 requires as compensation. Thus, party a’s offers a compromise 
policy. When G is sufficiently large, a offers the compromise policy 
y= Poe which reflects an optimal trade-off in its policy goals and 
its desire to hold portfolios. When G is small, however, a is willing to 
give up all of the portfolios in order to move policy in the direction of 
its ideal point. Finally, in the last case, a is sufficiently well off under 
pg compared to y that a is unwilling to compensate y for moving policy 
towards its ideal point. An interesting feature of this outcome is that 
the coalition is a non-connected one of the extreme parties. This is in 
contrast to arguments stressing that policy motivated parties will seek 
form coalitions with ideological allies (Axelrod 1970). 

Now consider the case where wg > Wa > wy. Once again y will 
choose y; = p, and g, = (0,0, G) in the last period. Now consider a’s 
choice in the second period. Clearly, it has nothing it can offer 7 and 
will therefore try to build a coalition with @. Thus, a will choose y,, 
and gg to maximize — (Yq — pa)’ +G—gg subject to — (Ya — ps)’ +9 > 
— (pg — p)and G > gg > 0. The solutions has four distinct cases. 
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(1) If 2578 > 2pg — p, and G > 4 (pa — ps)? — (ps — Py)”, VE = 
a * 2 2 
Por) and gs = + (Pa — Da)” — (pa — Py)’ - 
atts 2 Dr ny 
(2) If 3" > 2pg — py and G < 4 (pa—ps) — (Pp — Py) > Ya = 


Pe +4/G + (pe —p,)° and gi = G. 


(3) If pa > 2p — py > ™5"8, ys = 2g — p, and gi = 0. 


(4) If pa < 2pg — py, Ya = Pa and g% = 0. 

In the last two cases, a finds it optimal to play the Romer-Rosenthal 
setter game with 6 with a reversion of p, over policy and offer no 
portfolios. In cases 1 and 2, party a makes portfolio concessions 
to move policy further than the Romer-Rosenthal inflection point of 
2pB — Py. 

Despite the complexity of these cases, the important thing to note 
is that all predict y{ > pg and gy = 0. Thus, when party 6 makes 
its offer in the first period, it knows that party y will accept y3 = pg 
and g* = 0. Thus, the subgame perfect Nash equilibrium outcomes 
are y* = pg and g* = (0,G,0). Thus, this case generates a connected 
coalition that implements the ideal point of the median party. 

We leave proofs for the remaining cases as exercises. However, the 
following table summarizes the outcomes for all cases. 


Table 7.5: Outcomes of Austen-Smith and Banks’ Model 
Governing Coalition 


Wy > We DW, a and a 

Wg > We > Wy 8 and y PB 

Wg > Wy > We 6B anda 

Wag > Wy > We a and 6 

Wy > We > We y and 8 poe 

Wy > We > We y and a mee | 


A key point of Austen-Smith and Banks model is that composi- 
tion of the government and the policies it implements are driven by 
the voting weights which determine the sequence of proposals. Since 
these weights are determined voting behavior, the key to making pre- 
dictions is an understanding of how voters behave in anticipation of 
the parliamentary bargaining. We refer the readers to the original for 
an analysis of the voting game. 


5. Exercises 


EXERCISE 7.1. Diane is collecting money for the Center for the 
Study of Democratic Politics coffee fund. She needs to collect $2 from 
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at least three faculty members to operate the fund for the month. No 
member can contribute more that $2 and Diane cannot exclude non- 
contributors from drinking coffee. Each center member has an esti- 
mated $10 benefit from coffee service. If less than $6 is contributed, 
Diane keeps the money and no coffee is provided. If more than §6 is 
contributed, Diane provides the coffee and pockets the difference. 


a. Assume that Diane decides to ask the faculty members in the 
following order: Arnold, Bartels, Lewis, Prior, and Romer. 
Assume that each faculty member can observe who has con- 
tributed and who hasn’t... What are the set of Nash equilibria 
to this game? What is the unique Nash equilibrium that 
survives backward induction? 

b. Now modify the game somewhat so that Lewis, Prior, and 
Romer do not know whether or not Arnold and Bartels con- 
tributed. Further suppose that Lewis, Prior, and Romer must 
decide simultaneously. Draw a game tree for this game in 
extensive form. Pay particular attention to the information 
sets. What are the sub-game perfect equilibria to this game? 
Does Diane do better or worse in this game according to her 
personal payoffs? 

c. Now let Diane choose the information structure of the game 
i.e. she can choose which contribution decisions are revealed 
at which stage. Suppose she wants to maximize her payoffs. 
Which game should she choose? 


EXERCISE 7.2. Vote Buying (This exercise is based on Groseclose 
(1996)). Assume that there are N legislators with policy preferences 
u(x) forx €R. They must vote for bill xg against the status quo xo 
where Xp > Xp. So let a; = ui(xg) — ui(xo) the degree of preference 
for xp over xo for legislator i. We assume that each legislators policy 
payoff for voting for the bill is a; whether or not the bill passes. We 
also assume that there are two vote buyers L and R with net preference 
parameters a; andar respectively. Vote buyer L wants to defeat xp so 
that a, <0 while R wants to pass it as ap > 0. Consider the following 
model. R moves first and offers zf to each legislator who agrees to 
vote for the bill. L moves second and offers z¥ to each legislator in 
exchange for voting against xp. Thus, the payoff for voting in favor of 
rp isa; +z} while the payoff for voting against is —a;+ zP. 

a. Assume that N = 5 and that a; = —3+72 fori = 1,5. 
Characterize the subgame perfect Nash equilibria to this game 
for various levels of a; and ap. 

b. Can there be greater than minimum winning coalitions? 
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EXERCISE 7.3. Derive the policy outcome and governing coalition 
for the remaining cases of the Austen-Smith and Banks model. 


CHAPTER 8 


Dynamic Games of Incomplete Information 


In chapter 6, we saw that uncertainty about the preferences (pay- 
offs) of others fundamentally alters the strategic situation players face 
in static normal form games. The implications of this in dynamic, 
multi-stage games lead to even more interesting strategic possibilities. 
Consider the deterrence game of chapter 7. 


Insert Figure 8.1 Here 


Recall that the unique subgame perfect equilibrium is { Initiate, Acquiesce}. 
Suppose however the game is changed to the following: 


Insert Figure 8.2 Here 


Now the subgame perfect Nash Equilibrium is {Do Not Initiate, Escalate}. 
But what happens if the players do not know which game they are play- 
ing? We now consider games in which players face uncertainty about 
qualities of some of the other players. Games of this form are called 
games of incomplete information. Just as in chapter 6, we can 
model such uncertainty with the Harsanyi maneuver. The perspective 
is that uncertainty about the payoffs of other players can be interpreted 
as playing a game in which players are not certain about what history 
they are at. This trick involves the use of a fictitious player-Nature— 
that randomly selects players types from a distribution which is known 
to the players. If we want to model a setting where player 7 does not 
know player j’s preferences at a particular time then we assume that 
nature chooses player j’s payoffs (type) prior to agent i’s decision and 
we treat player 7 as facing an information set with multiple modes since 
she did not see which player j type was drawn by nature. This trick 
converts games of incomplete information — I don’t know the game— 
to games of imperfect information —I know the game but I don’t know 
what Nature did. 

In principle, analysis of games of incomplete information strains 
our ability to design satisfactory notions of equilibrium. But Mertens 
and Zamir (1985) have shown, subject to some very technical condi- 
tions, any description of incomplete information can be characterized 
as a Bayesian game, though one with a potentially very large type 
space. Following this conclusion, applied game theoretic models of 
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incomplete information begin with a description of the incomplete in- 
formation game as a game of imperfect information, with uncertainty 
about players showing up as uncertainty about the realization of na- 
ture’s randomization.! 

To make sense of this discussion, we return to our example. Sup- 
pose that Nature chooses game I with probability p and game IJ oth- 
erwise. Now consider the following information structures. 


(1) Suppose neither player observes Nature’s move as in Figure 
8.3. Then we say that information is imperfect but symmetric 
in that both players are in the same situation. This game is 
easy to analyze since all we need to do is compute country B’s 
expected utility of escalation and modify the game accordingly. 
Since B’s expected utility of escalation is —p8 — 3(1 — p) = 
—3 — 5p,it prefers escalation whenever p < z. Thus, if p< z 
the outcome will be {Do Not Initiate, Escalate} otherwise it 
will be {Initiate, Acquiesce} . 

Insert Figure 8.3 Here 


2. Suppose that only B observes Nature’s choice. This situation 
is depicted in Figure 8.4. This information structure implies 
that A will be uncertain of B’s choice. Since B only escalates 
in game II, A’s expected utility from initiating is 4p—(1—p)8 = 
—8+12p. Thus, A prefers initiating only if p > 2. 

Insert Figure 8.4 Here 


3. Suppose that only player A observes Nature’s move as in Fig- 
ure 8.5. Now the game has asymmetric information. This 
changes the strategic situation dramatically. While B doesn’t 
know Nature’s choice, it knows that A knows it. Thus, A must 
consider what information her choices provide about Nature’s 
draw. To see how these informational incentives effect behav- 
ior, consider the seemingly natural way of playing the game 
where A initiates in game I, but not in game II. If A played 
these strategies B would be able to infer from A’s initiation 
that they are playing game I and should acquiesce. However, 
if B responded in this way, A would have a strong incentive 
defect by initiating even in game II. 


‘In this book and nearly all applied game theory, multiperson decision problems 
with incomplete information are converted into games of imperfect information 
using Harsanyi’s trick of letting nature select types from a well defined set with 
common beliefs. This means that uncertainty about the preferences of players is 
no different (theoretically speaking) than uncertainty introduced by simultaneous 
moves or hidden actions. 
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Insert Figure 8.5 Here 


In this chapter, we focus on the strategic use of information in 
dynamic settings. As we will see, incomplete information raises a 
number of important issues. 


e Strategic Use of Information: Do any of the players have 
a strategic advantage based on how information is allocated? 
In many games, informed player have important advantages. 
However, we will see that often the uninformed player is ad- 
vantaged. 

e Learning: Can the uniformed players get more information 
from observing the actions of the informed players? How do 
these possibilities effect the strategies of the informed players? 

e Signaling: Can the informed players credibly communicate 
information about the game to the uninformed players? Can 
informed players mislead uninformed players? 


1. Perfect Bayesian Equilibria 


In dynamic games with imperfect information, players are often 
uncertain as to which histories (including Nature’s move) have been 
reached at the point in which they move. While subgame perfection 
can rule out some unreasonable Nash equilibrium, in many extensive 
form games with imperfectly observed actions a stronger equilibrium 
concept is needed. Consider the extensive form game depicted in Fig- 
ure 8.6. Player 1 chooses whether to secretly deploy military capability 
to attack an island. She can either not deploy any ships, ND, or she 
send a small fleet of ships (S) or a big line of ships (B). Player 2 
can only observe whether there was a deployment, as she can see the 
ships coming, but cannot determine how many ships are coming. If 
no deployment occurs then the payoffs are (0,5) as player 2 keeps the 
island. If there is an deployment, then player 2 must decide whether 
to respond to the attack (2). If there is no response (NR) then player 
1 wins the island. If there is a response, then player 2 wins the island 
but the casualties for player 2 are much higher under S then under B. 
The casualties for player 1 are higher under B then under S. 


Insert Figure 8.6 Here 


There are three Nash Equilibria to this game. The first is (ND, R). 
This means that player 1 does not deploy, but if she did player 2 would 
respond. The second Nash equilibrium is (B, NR). Player 1 deploys 
a big line of ships, and player 2 does not respond. The profile (5, NR) 
is also an equilibrium. 
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There is something very perplexing about first Nash equilibrium. 
Regardless of whether B or S' is played, player 2 is better off playing 
NR. Shouldn’t player 1 recognize this and send the ships? In the last 
chapter, we used subgame perfection to get us out of such conundrums, 
but that want help us here. Since this game has no proper subgames, 
(NS, R) is also subgame perfect Nash equilibrium. 

Our argument that this profile is not reasonable is based on the 
idea that player 1 should anticipate a rational response from player 2 
at player 2’s information set. We incorporate this type of sequential 
rationality into an equilibrium concept, by requiring that at each 
information set agents form beliefs about which history they have 
reached and select best responses given these beliefs. These equilibria 
are called Perfect Bayesian Equilibria or PBE for short. 

Returning to the example, we can see that no belief about the 
history of play at Player 2’s information set justifies the selection of R 
as a best response. Player 2 has to believe that either S or B ships 
have been deployed. In either case, she is better off choosing NR. 

Our example leans toward the trivial side of the spectrum so con- 
sider a slight modification. In this game player 1 can only win the 
island if she selects B. Moreover, player 2 would rather defend the 
island if player 1 has selected S. Figure 8.7 depicts the relevant payoffs. 


Insert Figure 8.7 Here 


In this version whether R or NR is sequentially rational depends 
on what beliefs player 2 assigns to the two possible histories in her 
information set. Ifshe believes that S was played then RF is sequentially 
rational. Conversely if she believes that B was played then NR is 
sequentially rational. What should she believe? Clearly, her beliefs 
are based on expectations about what player 1 does. But player 1’s 
choice will depend on what she expects player 2 to believe. How do 
we close this loop? 


1.1. Formal Definitions. In this section we present the tech- 
niques needed to analyze games of this form. We now define the 
concepts needed to characterize PBE. We start with beliefs over his- 
tories. 


DEFINITION 8.1. Given an extensive form game with imperfectly 
observed actions, [*! a belief on information set I; € I is a prob- 


ability distribution on I;. A belief profile is a mapping b: H — (0, 1 
such that for every I; € I b(-) is a belief on I; (that is for every I; € I 
Diner, (h) = 1 af J; ts finite and Snet; db(h) = 1 if I; is not finite). 
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So in the examples above, a belief on player 2’s information set, is 
a probability distribution over {S,B}. We use the term belief profile 
to describe a complete list of beliefs for all information sets. Since 
only one player makes a decision at each information set, there is no 
ambiguity about whose beliefs are relevant on each portion of the belief 
profile. If player 7 is called to make a choice at information set J; then 
the portion of the belief profile which describes the belief at information 
set J; describes player 7’s belief at information set J;. 

Given a belief profile, we can define a condition on strategies known 
as sequential rationality. Loosely speaking, sequential rationality re- 
quires that all strategies be optimal at each information set given a 
belief profile. To formalize this notion, let p(J;) denote the player and 
s(I;) denote the action called for at information set [;. These terms 
are equivalent to p(h) and s(h) when h € J;. For a fixed strategy 
profile s(-) we denote the expected utility to player p(J;) associated 
with the choice a at history h by Euyin)(a, h, s(-)). This is an expected 
utility (as opposed to a utility) because players other than p(h) may 
play mixed strategies. When player p(J;) assigns probability b(h) to 
being at history h € J; conditional upon being at the information set 
I;, the expected utility to taking action a at information set J; is 


Bup,)(@, I, s(-),B(-)) = S- b(h) Buyin) (a, h, s(-)). 


hel; 


A strategy profile is sequentially rational relative to a belief if it in- 
volves optimal actions at each information set, when players evaluate 
the desirability of action a using Eupiz,)(a, 1;, s(-), b(-)). Note that when 
there is infinite set of possible histories, the summation is replaced with 
integration. 


DEFINITION 8.2. Given an extensive form game with imperfectly 
observed actions, T£! and a belief b(h) on each information set, the 
strategy profile s(-) is sequentially rational (relative to the be- 
liefs) at information set I; if given any available action s' we have 


Eup) (85), Lj, 8(-), 00) = Bupa (8's Lj, 8); 0C)). 


If the strategy profile is sequentially rational (relative to the beliefs) 
at every information set, then it is sequentially rational (relative 
to the beliefs). 


Returning to the example in Figure 8.7 above, if the beliefs assign a 
probability close to 1 on S then R is sequentially rational at the infor- 
mation set. Similarly if player 2 believes B then NR is a sequentially 
rational response. 
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We now consider a condition on beliefs known as consistency. Con- 
sistency essentially requires that agents use Bayes’ Rule to formulate 
their beliefs at J; whenever possible. 

Recall that Bayes’ rule provides us with probability that event A 
occurs conditional the occurrence of B. It is given by 


Pr(4| B) = 


Consequently, consistency requires that agents compute the probability 
of particular history h € J; conditional on reaching J;. Of course, 
the probability of reaching J; depends on the strategy profile that the 
players are using. So we use Pr(J;|s(-)) to denote the probability that 
I; is reached conditional the strategy profile s(-). Secondly, note that 
since h is assumed to be an element of J; and therefore by definition no 
other information set, the probability that h and J; are both reached 
under strategy s(-) is simply the probability that h is reached. We 
denote this probability as Pr(h|s(-)). 

Therefore, Bayes Rule implies that the probability of reaching his- 
tory h € J; conditional on reaching information set J; under strategy 
profile s(-) is 
_ Pr(hl s(-) 

Px(l 8) 


Thus, weak consistency is the requirement that b(h) = Pr(h | J;, s(-)) 
whenever possible. 


Pr(h | 15, 5(-)) 


DEFINITION 8.3. Given an extensive form game with imperfectly 
observed actions, [&! and a strategy profile s(-) we say that the beliefs 
b(-) are weakly consistent relative to strategy s(-) if b(h) = Pr(h | 
I;, 8(-)) whenever Pr(I; | s(-)) > 0. 


Of course, if J; is not reached under s(-) then Pr(J;| s(-)) = 0 so 
that Bayes rule is undefined. Thus, consistency places no requirements 
on beliefs on “out of equilibrium” information sets. This weakness is 
sometimes problematic, as we will see. Combining weak consistency 
of beliefs and sequential rationality of strategies yields the equilibrium 
concept PBE. 


DEFINITION 8.4. Given an extensive form game with imperfectly 
observed actions, [®! a perfect Bayesian equilibrium (PBE) is a 
pair (s(-),b(-)) such that: (1) the strategy profile s(-) is sequentially 
rational relative to the belief b(-), and (2) the belief b(-) 1s weakly con- 
sistent relative to the strategy profile s(-). 
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Thus a PBE requires the construction of beliefs. The existence of 
beliefs allows us to define a notion of sequential rationality (optimality 
of choices at histories). Moreover, the beliefs that players entertain 
are related to the equilibrium strategies, in that histories which are 
relatively more likely to be reached under a strategy profile, are be- 
lieved to occur with a higher probability. It should not be surprising 
that this notion equilibrium has a certain circularity to it. Recall, 
that Nash equilibrium requires that strategies are individually best 
responses given a conjecture of other players strategies and that the 
conjecture turn out to be correct. Similarly PBE requires that strate- 
gies be best responses given beliefs which depend on the conjectured 
strategies of other, that the beliefs are reasonable given the strategies 
and that the conjecture about strategies be correct. 

Returning to the game in Figure 8.6, we can now consider what 
strategy profiles occur ina PBE. Clearly the Nash equilibrium (ND, R) 
is not supportable as a PBE, because for any beliefs about which history 
S or Bplayer 2 is at when her information set is reached, NR is the 
unique response that is sequentially rational for 2 at the information 
set. Now given that player 2 is choosing NR, player 1’s optimal 
choice is to play either S or B. Now if player 1 chooses B then 
consistent beliefs must assign probability 1 to player 2 being at history 
B. Thus, one PBEis (B, NR), Pr(B) = 1, where Pr(B) is the posterior 
probability of B given that player 2’s information set is reached under 
player 2’s beliefs. Similarly there is a PBE of the form (S, NR), 
Pres) =O) 

Now consider the game in Figure 8.7. If player 2 believes that 
Pr(B) = 1 then NR is the best response. On the other hand if player 
2 believes that Pr(B) = 0 then R is the best response. One candidate 
for a PBE is (ND, R), Pr(B) = 0. Note that since no constraint is 
imposed on beliefs over the histories B and S when player 1 plays ND, 
the belief Pr(B) = 0 is consistent relative to the strategy ND But, the 
strategy profile (ND, R) is not sequentially rational as player 1 would 
prefer to play B than N D when she conjectures that player 2 is playing 
R. It is also clear that ND cannot be a best response to NR. 

Alternatively we can try to characterize a pure strategy PBE in 
which ND is not played. If B is played and beliefs are consistent the 
only sequentially rational strategy by 2 will involve NR. But if player 
1 conjectures that player 2 is playing NR she will want to play S. So 
we cannot have B played in a pure strategy PBE. On the other hand 
if S' is played then consistent beliefs must assign probability 1 to player 
2 being at this history. Thus, the only sequentially rational action will 
involve playing R. But if player 1 conjectures that player 2 is playing 
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R then she will want to play B. Thus we cannot have a pure strategy 
PBE in which S is played. We have thus shown that there is no pure 
strategy PBE to the game. 

It is not difficult to characterize the mixed strategy PBE to the 
game. Suppose that player 1 plays B with probability q and 6 with 
probability (1 — q). Further suppose that player 2 plays R with prob- 
ability z and NR with probability (1 — z). Consistency of beliefs 
requires that Pr(B) = gq. Now for player 2 to be indifferent between 
Rand NR it must be the case that 


q(—5) + (1 — 4)2 = q(0) + (1 — 40 


This requires that q = 2. Now in order for player 1 to be indifferent 


between playing B and S it must be the case that 
(1 — z)5+ 2(-2) = (1— 2)4+ 23 


This requires that z = g. Accordingly the strategy profile, S with 
probability 2, B with probability z, R with probability Z and NR with 


probability 2 is supportable as a PBE, with the beliefs Pr(B) = z, 


1.2. Signaling Games. An important class of games of imperfect 
information involve asymmetric information with the more informed 
agent, the sender, moving first followed by the less informed receiver. 
These games take their name from the possibility that the sender’s 
action will convey information about her type to the receiver. We 
begin with the simplest possible signaling game to demonstrate some 
of the potential incentives faced by the sender. 

Let Nature draws a type @ € {a,b} for player 1. Player 1 observes 
her type and chooses a “message” m € {a,b}. Player 2 observes the 
message but does not observe player 1’s type. Following the message, 
player 2 chooses a “policy” p € {a,b}. The payoffs to each player from 
a type, action pair are denoted u;(p,0). Figure 8.8 depicts the game 
form. 


Insert Figure 8.8 Here 


Here the non-trivial information sets involve moves by player 2 and 
are represented by the dotted lines. When player 2 makes a policy 
selection, she knows what message was sent by player 1, but she does 
not know which state has been chosen by nature. 

Assume that u2(p,0) > u2(p',@) and ue(p’, 0’) > ue(p, 6’) so that 
player 2 would like to know @ before selecting p. We can motivate 
this assumption with a simple interpretation: The value of 6 effects 
the desirability of each policy and player 1 wants to match @ and p 
while player 2 wants the pair to be unmatched (9 £ p). An example 
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is a legislature (agent 2) choosing between two policy alternatives a 
and 6} that are both risky but desirable in expectation. An informed 
expert (agent 1) gives unverifiable testimony before Congress about the 
relative risks of the two alternatives and 6 denotes the identity of the 
policy which is actually more risky. If the legislature is more risk 
averse than the expert then the preference profile described above is 
appropriate. In this game player 1’s action is called a cheap talk speech 
because agent 2 cannot verify the accuracy of 1’s speech, and there is 
no explicit cost to lying. Our assumption that 1 wants to match p and 
6 while 2 does not is consistent with the following ordering of payoffs. 


ur(b, a) < ur(a, a) 

ur(a, b) s ur(b, b) 

ud(d, a) = u2(a, a) 

ug(a, b) = ud(b, b) 
To make the description one of imperfect information, we need to fur- 
ther specify a pair of prior beliefs over the state 6. Suppose that # = a 
with probability 7 > 5. One natural question to ask is whether there 


is a PBE in which player 1, the informed player, reveals her private 
information. This would require that she use one of the following 


strategies 
aite=a 
m(@) =| bif =) 
or 
bifO=a 
m() =| aifO=b 


We begin by focusing on the first message strategy. If player 1 uses 
this strategy profile, consistency of beliefs requires that 
mw: 


a a ala (a 


and 
T-O 


7-O+(1—7)-1 
Given these beliefs sequential rationality requires that agent 2 select 
policy according to the following mapping 


pm) ={ firm =p . 


b(0 =a|m=b) 


aifm=b 


The last thing to check is whether the specified m/(-) strategy is sequen- 
tially rational. Here the critical question is whether it represents a best 
response to the mapping p(-). Note that ifm = a the policy is b and if 
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m = b the policy is a. Since wi(b,a) < ui(a,a) and u4(a,b) < u1(b, d) 
a player 1 that has observed 6 = a can deviate from the strategy and 
announce m = b which will result in the outcome a. This outcome is 
more desirable than the outcome of playing the conjectured strategy. 
Similarly if player 1 observed 6 = b, she can gain be deviating and 
announcing m =a. This argument demonstrates that there is no PBE 
in which the“truthful” message strategy m(@) = 0 is deployed. It is 
left as an exercise to show that there cannot be a PBE in which the 
second message strategy listed above is used. 

The message strategies defined above are called separating because 
if they are used by the sender then the receiver learns the type 0. We 
showed that if the truthful message strategy is used then consistency 
of beliefs required that the beliefs are deterministic. 

While the game does not possess a separating equilibrium, there are 
other possible PBE. Suppose that the sender sends the same message 
(say a) regardless of 0.Thus m(@) = a. Given this message strategy, 
the receiver learns nothing from observing m. In this case consistency 
of beliefs requires that 


rT: 


pete 


6 =a. m= oa) 


What beliefs should the receiver form following an m = b? This 
question is tricky. If we try to use Bayes’ rule we get 


Tt -O 0 


P= Ot) = ern aye. TO: 

Since 7 is not a number, Bayes’ rule is not well defined. This is because 
Bayes’ rule conditions on a history that occurs with zero probability 
given the strategy profile. The definition of weak consistency only 
requires that beliefs obey Bayes’ rule when the denominator is greater 
than zero. Accordingly, in our efforts to characterize an equilibrium 
with the message strategy m(0) = a, the complication of defining b(6 = 
a |m = b) is not insurmountable. Since weak consistency imposes no 
constraints on this belief, we are allowed to specify this posterior in 
whatever manner is required to help us construct equilibria. Let’s 
say that ((9 = a | m = b) = a. Given this specification of beliefs, 
the question of what receiver strategy is sequentially rational requires 
simply comparing expected utilities. Policy a is more desirable if 


Tu2(a, a) + (1 — 7)ue(a, b) > rue(b, a) + (1 — 1)u2(b, d) 
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Otherwise policy b is more desirable. Accordingly, if 


(8.1) TZ U2(a, a) — U2(a,b) — u2(b, a) + ua(b, b) 


then given these beliefs sequential rationality of p is satisfied by p(m) = 
b. If the inequality in equation 8.1 is reversed then sequential rationality 
of p is satisfied with p(m) = a. Finally, we must check that the message 
strategy is sequentially rational given the policy function. This step 
is trivial. Because the policy function is constant in m, any message 
function is a best response. Accordingly, we have found a pooling 
equilibrium of the signaling game. 

Returning to the question of what b(9 = a | m = b) should be, 
we can see that the answer to this question is quite important. As- 
sume that 7 is sufficiently big so that the receiver’s best response is b. 
Suppose, now that the b(@ = a | m=) = 0. In this case sequential 
rationality requires that p(b) = a. So changing the off-the path beliefs 
and the requirement of sequential rationality means that the off-the- 
path policy action must change. Can we construct an equilibrium with 
these new beliefs and a policy function that uses the pooling message 
function m(@) = a? Consider a sender observes 6 = a. In the con- 
jectured equilibrium she is supposed to send message a. The receiver 
learns nothing from the message and selects b because a is more likely 
(7 is high). But now, if our sender deviates from this conjectured 
strategy and sends the message b, the receiver’s response is to select 
policy a. If this conjectured strategy and belief profile is a PBE it must 
be the case that the sender never has an incentive deviate in this way. 
However, when 6 = a the sender is better off deviating and eliciting 
p =a instead of p= b. Thus, while the pooling message function was 
supportable in a PBE with some consistent beliefs, it is not support- 
able as a PBE with every set of consistent beliefs. The off-the-path 
beliefs matter as they create incentives for on-the-path behavior. 

Another way of constructing pooling equilibria in this game is to 
appeal to mixed strategies, by the sender. Suppose now that regardless 
of her type the sender sends message a with probability o € (0,1). 
Given this Bayes’ rule yields 


(0 = 6. =4) =00 S20 |w=b)-=a2 


As before characterizing sequentially rational strategies for the receiver 
hinges on 7. One key difference between this pooling equilibria with 
mixed messages and the case of pure message strategies, is that with 
mixed strategies Bayes’ rule pins down the beliefs at every observable 
information set. 
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In more general settings we can characterize equilibria along these 
lines, separating, pooling, and introduce another category called par- 
tially separating or partially pooling.” 


DEFINITION 8.5. In a general signaling game with multiple senders 
and one receiver in which each player has a type space O;, message 
space M; and the receiver has prior beliefs 7(-) on 8 = x;,0; we have 
the following: in a separating equilibrium on the equilibrium path the 
receivers posterior beliefs are concentrated at the true state. In a pool- 
ing equilibrium, on the equilibrium path the posteriors correspond to 
the priors, in a partially separating equilibrium neither of the above 
conditions are true. 


2. Application: Entry Deterrence in Elections 


One of the most intriguing puzzles in the study of campaign fi- 
nance is the question of why incumbent politicians exert so much effort 
to raise more campaign money than seems necessary to simply finance 
their campaigns. A standard explanation is that incumbents raise to 
these sums to deter entry by potential challengers. Thus, fundraising 
success is used to signal the incumbent’s electoral strength. A formal 
model of this phenomenon has been developed by Epstein and Zemsky 
(1995). Suppose a challenger is deciding whether to run against an 
incumbent for office, but that the challenger only wishes to run when 
the incumbent is politically “weak.” If the incumbent is “strong”, the 
challenger would rather sit out the race. Conventional wisdom sug- 
gests that the incumbent may wish to signal to the challenger that she 
is strong by raising a large amount of campaign monies before the chal- 
lenger decides to enter. If this “war chest” convinces the challenger 
that the incumbent is strong, he may be deterred from entering. ‘To 
capture this intuition in a model, let p, be the prior probability that 
the challenger (C’) places on the incumbent (J) being strong (5). Ob- 
viously, 1—p, is the probability that the incumbent is weak (W). Both 
the C' and J get 1 unit of utility from office. Let the probability that 
C' wins against W be 7, while C beats S with 7, where 7, > 7,. Let 
k be C’s cost of running where to keep things interesting we assume 
that tT, > k > a,. If k > ty, C would never enter and if k < 7, he 
would always enter. 

The key assumption of the model is that S and W have different 
costs of raising a war chest. ‘To keep things simple, we model only 


Those who see the glass half-empty will probably prefer the term partially 
pooling. Others will find partially separating more in line with their philosophies 
of life. 
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whether or not the incumbent builds a war chest, but not his choice of 
size. Thus, the incumbent’s strategy set is S; = {WC,”" WC} where 
WC is the decision to build a war chest and ~WC' the decision to forgo 
one. By s; € S; we denote the strategy chosen by J. Thus, we assume 
that types W and S must pay c, and c, respectively to build a war 
chest where c, < C,. Note that the probability that the incumbent 
wins depends only on his type. There is no direct effect of the war 
chest except for its role in signaling the incumbent’s strength to the 
challenger. After observing whether J builds a warchest, C' decides 
whether to enter the race F or sit it out ~E. 

Figure 8.9 provides the extensive form of this game. We begin our 
analysis with the last stage of the game. Clearly C' will only enter if 
the expected utility of entering is greater than or equal k. Thus, entry 
requires that 7, Pr{S|s;} +7, Pr{W|s;} > k. 

Insert Figure 8.9 Here 


Note that the game in Figure 8.9 can also be in presented in a 
manner similar to Figure 8.8. We depict both approaches to familiarize 
readers with different graphical presentations. 


2.1. The First Period. In the first period, W and S must choose 
strategies. There are three possible types of equilibria: 


(1) Separating: S' and W choose the different actions. 
(2) Pooling: S and W choose the same strategies 
(3) Semi-pooling: S' and W choose different mixed strategies. 


2.1.1. The Separating Equilibria. We first consider a separating equi- 
librium where the strong incumbent builds a war chest and the weak 
incumbent does not. Given these strategies, the challenger will learn 
the incumbent’s type in equilibrium and will enter only if the incum- 
bent does not build a war chest. 


PROPOSITION 8.1. If c, < 7, and cy > Ty, the following strate- 
gies and beliefs constitute a perfect Bayesian equilibrium:s(S) =WC, 
s(W) = WC, s(C) = E if WC and “E otherwise, Pr{S|WC} = 1, 
and Pr{S|"WC} = 0. 


Given these strategies, the application of Bayes’ rule suggests that 
C’s equilibrium beliefs are Pr{S|WC} = 1 and Pr{W|"WC} = 1. 
Since 7, > k > 75, C’s strategy to enter only in the absence of a war 
chest is a best response. Now, we need to check that the incumbent’s 
strategies are best responses. Incumbent S’s utility from WC is 1 —c, 
which is greater than her utility of "WC which is 1 — 7, since c, < 7s. 
Now, we check whether s(W) = “WC is a best response. The payoff 
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for building a war chest is 1 — c, while the utility for not building a 
war chest is 1 —7,. So ~WC is a best response since we assume that 
Ci 2 

It is worth trying to generate some intuition as to why the con- 
ditions c, < 7, and c, > Ty, are required to support a separating 
equilibrium. It is clear that for a war chest to credibly signal strength 
it must be substantially more costly for the weak incumbent to build it. 
If this were not the case, weak incumbents would also build war chests 
and it would no longer be sequentially rational for C’ to be deterred. 
But if war chests did not deter challenges, neither type would seek to 
build them, destroying the PBE. 

Next we show that there exist no equilibria where the signal is re- 
versed so that weak incumbent build warchests and strong incumbents 
do not. 


PROPOSITION 8.2. s(S) = “WC and s(W) = WC cannot be a 
Bayesian Nash equilibrium. 


To see that the claim is true, note that C’s best response to these 
strategies would be {enter if WC}. Then S would get a utility of 1 for 
“WC and 1—72,-—c, for WC. So S’s strategy would be a best response. 
However, consider W’s best response. She would get 1 — 7, — c, for 
WC and 1 for "WC. Clearly, s(W) = WC is not a best response. 

The reason that no such separating equilibria exists can perhaps be 
understood in terms of the concept of incentive compatibility. We say 
that a messages m and m’ are incentive compatible for types 6 and 6’ 
if and only if EU(m|@) > EU(m'|@) and EU(m'|6') > EU(m|6’). In 
other words, each type must weakly prefer its own message to that of 
the other type. Clearly, incentive compatibility is a necessary condition 
for a PBE. 

Note that for the reversed signals to be an equilibrium, the incentive 
compatibility requirements are 


EU(-WC|S) > EU(WC|S) 
EU(WC|W) > EU('WC|W) 


We can combine these requirements by adding the inequalities and 
moving things around so that 


EUCWC|S) — EUCWC|W) > EU(WC|S) — EU(WC|W) 
which becomes 


Pr{E|"WC} (a, — 75) > Pr{E|WC} (aw — 1s) + Cw — Cs 
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Note that since c, > c, and Ty > 7s, the incentive compatibility 
constraints require that Pr{E|° WC} > Pr{E|WC}. However, given 
C’s beliefs, this cannot be a best response. Therefore, reversing the 
signal is not a PBE. 

We will see this incentive compatibility approach used much more 
extensively when we look at models of mechanism design in chapter 11. 

2.1.2. Pooling Equilibria. Now we turn to the analysis of pooling 
equilibria where both types of incumbents choose the same strategy. 
Generally, the easiest way to construct such equilibria is to specify 
the most unfavorable beliefs possible in the event that one of the 
senders chooses the out-of-equilibrium message. In this context, this 
requires that C' believe that the incumbent is weak following an out- 
of-equilibrium action. If these “pessimistic” beliefs support a strategy 
profile, then that profile constitutes a PBE. However, a number of 
slightly less pessimistic beliefs will also support that profile as a PBE. 
So its a good practice to compute the set of beliefs that support each 
PBE. In the current model, this requires us to specify the largest 
posterior on Pr{S} following the defection that supports the PBE. 


PROPOSITION 8.3. Suppose that p, > Junk, The following strate- 
gies and beliefs are a perfect Bayesian equilibrium: s(W) = s(S) = 
“WC; 8(C) = {E if WC}, Pr{S|"WC} = p, and Pr{S|WC} < 7. 

Since W and S play the same strategy, Bayes’ rule implies that 
Pr{S|"WC}=p,. So C’s utility from entering is 


TsPo + Tw(1 — po) —k = Tw — (Mw — Ts) Po —& 
so C' will enter if p, < Zw—h after observing “WC. What should C 


believe and do if he observes WC? Nash equilibrium is silent about 
what to do off the equilibrium path. Note that we cannot apply Bayes’ 
rule because the denominator would be zero. So we assign the arbitrary 
beliefs Pr{S|WC} < =< after observing WC. Thus, C should enter 
if he observes WC’. Now consider the strategies of S and W. They 
both get 1 for not building a war chest and 1—7,—c, and 1—7y,—cy 


respectively for building one. Thus, their strategies are best responses. 


PROPOSITION 8.4. Suppose that po < eS The following strate- 


gies and beliefs are a perfect Bayesian Saline s(W) = s(S) = 
“WC; s(C) = {E}, Pr{S|"WC} =p, and Pr{S|WC} < 2 


wos” 


Clearly, C’s strategy of entering is a best response since regardless 
on what the incumbent does C’ will think it unlikely that the incumbent 
is S. On the equilibrium path, Bayes’ rule implies that Pr{S|*> WC} = 
Po. Off the equilibrium path, we are free to assign Pr{S|WC} < 
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— Since C' always enters, a war chest by either type of incumbent 


is a waste of resources. Thus, neither type of incumbent builds one. 


PROPOSITION 8.5. Suppose that p, > aaa The following strate- 
gies and beliefs are a perfect Bayesian equilibrium: s(W) = s(S) = 
“WC; s(C) = {" E}, Pr{S|"WC} = p, and Pr{S|WC} > ae 


C’s best response given the beliefs on the equilibrium path is to stay 
out of the race. Suppose instead that the incumbent defected and built 
a war chest. In this equilibrium, C’ believes that it is relatively likely 
that the incumbent is strong at the out-of-equilibrium information set. 
So C still chooses ~ FE. Since C' always stays out, it is optimal for neither 
incumbent to build a war chest. 


PROPOSITION 8.6. Suppose that p, > =, Cy Tas ONE Cy = Ay 
The following strategies and beliefs are a perfect Bayesian equilibrium: 
s(W) = s(S) = WC;s(C) = {E if WC}, Pr{S|WC} = po and 
Pr{S|"WC} < wah 


Clearly, given equilibrium beliefs, C’s strategy is a best response. 
If C observes “WC, we assign a probability of S sufficiently low so that 
she chooses FE following a defection. So consider incumbent S who gets 
1—c, in equilibrium but gets 1—7, by defecting. So S' will choose WC 
so long as c, < 7,. Incumbent W gets 1 — c,, from the equilibrium 
and 1 — 7,, from defecting so WC is a best response if c, < Ty». 


PROPOSITION 8.7. There is no equilibrium where s(W) = s(C) = 
WC and C always enters. 


Both types would defect to “WC since it does not change C’s be- 
havior. 

Clearly, there are two types of pooling equilibria to this game: one 
where both incumbents build warchests and one where neither does. 
Note that both reveal the same amount of information to the chal- 
lenger, namely none, but differ in the costs incurred by the incumbent. 
Clearly, every player prefers the PBE where both types play “WC to 
the one where they both play WC. Thus, we can refer to the former 
as the efficient PBE and the latter as the inefficient one. Importantly, 
there is nothing intrinsic to the concept of PBE to predict which of 
these equilibria are more likely to be played. However, as we will 
see, various authors have proposed criteria for refining the set of PBE. 
Often these refinements select the efficient PBE. 

2.1.3. Partial Pooling. The remaining possibility is that at least 
one of the incumbent types chooses a mixed strategy that occasion- 
ally separates from the other type and occasionally pools. There are 
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a couple of reasons for exploring such possibilities. First of all, if 
such equilibria exist, a full characterization of the set of PBEs requires 
analysis of partial pooling equilibria.* Secondly, authors sometimes 
would like to analyze the most informative equilibrium i.e. the one in 
which the receiver’s posteriors are closest to the true distribution of 
types. In many cases, semi-pooling equilibria will exist for parameter 
values for which there are no separating equilibria. In this cases, the 
partial pooling equilibria will be the most informative. 

We present only one of the partial pooling equilibria, and leave the 
other possibility to the reader as an exercise. In this equilibrium, the 
S incumbent always builds a warchest. However, the W incumbent 
also builds one with probability q. 


PROPOSITION 8.8. If > =, then s(S) =WC, s(W) = {WC with 
prob= q}, C = {enter if ~ wc aia enter with probability r if WC} is 
a perfect Bayesian equilibrium. 


Clearly, Bayes’ rule implies that Pr{S|~>WC} = 0 and thus follow- 
ing “WC entry is a best response. When WC’ is observed, Bayes’ rule 
is more complicated: 

Pr(WC | S) Pr(S) 7 Po 
Pr(WC]S) Pr(s) + PWC | W)PxW) — po Fa(l —po) 
Since C’' plays a mixed strategy, he must be indifferent between entering 
or not entering so that 


ts Pr{S|WC} + ty Pr{W|WC} =k 


Pr(S | WC) = 


or 
TsPo Tw(1 — po) 4 
Po+q(1—po) pot+aq(1 — po) 
Solving we find that 
g* = po(k — 7s) ; 
(1 — po) (tw — ki) 


When C’ chooses r, it must make W indifferent between building WC 
and not doing so. If W chooses “WC, she gets 1—7,,. Choosing WC 
gets r(1—7,)+(1—1r) — cw. So the indifference condition implies 
that 


3The authors have revelaed their type. 
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Now we need only check that S prefers WC. If she plays “WC she 
gets 1 — 7, whereas in equilibrium she gets r(1—7,) + (1 —1r) — cs. 
Thus, 

BUSWO= EUSCWO) = aie = SSN 


Tw 


by assumption. 

We know from above that the separating equilibrium only exists 
when = < 1 < @ while the semi-pooling equilibrium that we have 
just considered exists when << “. Thus, whenever the separating 
equilibrium exists, the semi-pooling equilibrium must also exist. How- 
ever, the partial pooling equilibrium exists in many circumstances (i.e. 
ai 1) where the separating equilibrium does not. 


3. Application: Information and Legislative Organization 


One of the longest standing debates about legislative politics is the 
nature of the role of standing committees. While some scholars have 
focused on their role in stabilizing majority rule (Shepsle 1978), main- 
taining distributive coalitions (Shepsle and Weingast 1987, Weingast 
and Marshall 1988) while others have focused on their role in promot- 
ing the interests of the majority party (Cox and McCubbins 1994). 
However, Gilligan and Krehbiel (1987) present an alternative model 
based on the idea that the role of committees is to specialize in the 
development of policy-specific expertise. This quite influential theory 
is based on games of incomplete information. 

The basis of these models is the idea that policymakers do not 
always know the exactly link between their policy choices and the ul- 
timate policy outcomes. For example, legislators may not know how 
a specific agricultural policy will affect farmer’s incomes because of a 
number of unforeseeable intervening variables such as the weather and 
competition from foreign producers. Since legislators would presum- 
ably value such information, they should desire institutional arrange- 
ments that facilitated its gathering and transmission. Gilligan and 
Krehbiel argue that a committee system with some limited parliamen- 
tary rights will help solve these informational problems. 

To capture the distinction between policy choices and outcomes, 
Gilligan and Krehbiel assume that the policy outcome x is a additive 
function of policy p and random term w so that 


T=ptw. 


Their game is played between two players: the floor F’ and the com- 
mittee C. In our version of the model, we assume that C’ knows w 
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with certainty but that F’s prior beliefs are that: 


_ J @ with prob .5 
“= _@ with prob .5 


They assume that each has quadratic spatial preferences over a single 
dimension and that F’ has an ideal point of 0 and Chas an ideal point 
c > 0. Thus, the payoffs are up(x) = —a? and uc(r) = —(c — x)? 
respectively. However, since F' does not know w, his utility from a 
given policy p is given by the following expected utility: 


5up(p +0) + .5ur(p — 6) = —.5[p? + 2p6 + 67] — .5[p? — 2p0 + 67] 
2 2 


Thus, F’s utility has two components. The first is —p? which reflect 
the difference between his ideal point and the expected policy while the 
second is —6* which reflects the variance in the policy shock w. Thus, 
F is risk adverse in that he would be willing to “pay” to obtain informa- 
tion about w. Our concern here is whether F' may be willing to grant 
extra parliamentary rights to C’ to provide it with the incentive spe- 
cialize and provide information. Formally, F’ chooses the procedures 
or rules under which legislation proposed by C’ can be considered. To 
simply things, we assume that F’ chooses between the following types 
of rules: 


(1) Open rule: The committee reports a bill and the floor may 
freely amend it. 

(2) Closed Rule: The F’ must vote up or down on C’s proposal 
against the status quo, SQ = 0. 


The closed rule obviously represents more extensive parliamentary 
rights for C' since she can make a take-it-or-leave to F. To solve this 
as a game of incomplete information, we need to specify strategies of 
both “types” of committees (—0,6), beliefs of the floor following any 
proposal, and the floor’s best response given these beliefs. Following 
Gilligan and Krehbiel, we will look for the most “informative” equilib- 
rium. In our context, we will mainly be interested in whether or not 
a separating equilibrium exists.4 

3.0.4. Open Rule. Under the open rule, the committee must worry 
about whether the information it provides can be used to “roll” it on 
the floor. Suppose that the committee revealed all of its information 


“The reader who is paying attention should respond “aha! but haven’t you 
warned us that informative semi-pooling equilibria can sometime exist when sep- 
arating equilibria do not!” The attentive reader should consult excercise 8.6 and 
verify for herself that this is not one of those cases. 
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by proposing its ideal policy for each outcome. This means that the 
committee chooses p° so that x = c or p° = c— @ when w = @ and 
p° =c+0 when w = —@. Since there are distinct proposals for each 
state of the world, these proposal strategies fully reveal all information 
to the floor. Thus, under an open rule, the floor will amend until 
x = 0 or p! = —@ when w = 6 and pf = 6 andw = —0. Therefore, 
utilities for this equilibrium are up = 0 and uc = —c?. Since we are 
interested in determining whether or not a separating equilibria exists, 
we will specify the beliefs most unfavorable to C’ following an out-of- 
equilibrium proposal. Thus, we assume that F’ treats any other bill as 
if it came from the “high” type: w = @ and adopts p/ = —@. Given 
F’s best responses, it is easy to see that there are three only outcomes 
that C' can generate —26, 0, 20. 

Now we can check to see if C like to defect from this separating 
equilibrium. If she defects by proposing c+ 0 when w = @, F will 
pass p! = 6. Since the resulting outcome is 20, C’s utility from the 
defection is —(c—20)*. Alternatively, by proposing c—@ when w = —90, 
pt = —@ leading to = —20 and u. = —(c + 26)”. Finally, consider 
p° ¢€ {c+0,c—6}. Since all of these proposals generate p! = —8, 
u-(@) = —c? and u-(—@) = —(c + 26)”. 

Since c > 0 and @ > 0, the highest utility defections are given in 
Table 1. 


Table 8.1: Possible Defections 


Cuilty 


ifw=@andc>@ 


ifw=Oandc<@ 
ifw = —6 


Clearly the “low” type won’t defect, since —c? > —(c + 20)”. Sim- 
ilarly if c < 0, the“high” type will not defect since her equilibrium 
utility is the same as that of her most profitable defection. However, 
when c > 6 and w = 6, the high type prefers the utility of —(c — 26)? 
to her equilibrium payoff. Therefore, the separating equilibrium does 
not exist whenever c > 0. Absent a separating equilibrium, the only 
equilibrium is an uninformative pooling equilibrium where both types 
use the same mixed strategy across the set of proposals. 

Thus, when the committee is an “outlier” i.e. c > 8, no information 
can be revealed. In this case, the high type wants to convince the floor 
that w = —@ since it prefers policy @ to 0. 


3.0.5. Closed Rule. Under the open rule, a major reason why infor- 
mation cannot be revealed by the “outlier” committee is that the floor 
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will use this information to “roll” the committee and move policy to 
the floor median. However, under the closed rule, the committee can- 
not be “rolled”. The floor must accept or reject the proposal in favor 
of the status quo. So now suppose there is a separating equilibrium 
where F learns the value of w. The utility of the status quo is —w? 
and of any other proposal —(p+w)?. This implies that F' will accept 
any proposal in the interval between 0 and —2w. Thus, when w = 6, 
the largest policy F’ will accept is 0. This results in an outcome of 0. 
When w = —6, the largest policy F' will accept is 26. ‘This also results 
in an outcome of 6. 

So the committee can guarantee an outcome as high as 6 by propos- 
ing p® = 0 when w = @ and p° = 20 otherwise. If c < 8, an outcome of c 
can be guaranteed by proposing p° = c— 6 when w = @ and p° = c+. 

To complete the specification of the perfect Bayesian equilibrium, 
we need to specify F’s behavior and beliefs following any out-of-equilibrium 
proposals from C’. To support all possible separating equilibria, it is 
sufficient to have F' accept any proposal such that —20 < p° < 0 and 
vote down out-of-equilibrium proposal p° > 0 and. This strategy is 
a best response to the beliefs Pr {w = 6|p°} = 1 following an out-of- 
equilibrium proposal. 

Now we check that C' prefers its equilibrium strategy to any defec- 
tion. When c < 6, C gets its ideal point so she cannot do better by 
defecting. Thus, a separating equilibrium exists, just as it did for the 
open rule. So we need only focus on the case where c > 6. First 
suppose that w = —0, then C gets —(c — 0)? for p° = 20 and —(c +6)? 
for any other bill. C will not defect in this case. Now suppose that 
w = 0, then C receives —(c— 6)? for p° = 0, —(c— 36)? for p° = 20, and 
—(c— 6)? for any other bill. Thus, so long as c < 26, C' weakly prefers 
her equilibrium proposal p° = 0. However, if c > 20, the committee 
will defect so that no separating equilibrium exists. In this case, the 
only equilibria are uninformative ones such as those where both types 
of committee use the same mixed strategy over all proposals and the 
floor rejects all proposals except p° = 0. 

Since the closed rule can sustain a separating equilibrium in cases 
where the open rule cannot i.e. 20 > c > 0, the model predicts that 
restrictive rules might be employed to encourage greater information 
transmission from the committee to the floor. 


3.1. Committee Specialization. In the preceding section, we 
showed that our version of the Gilligan-Krehbiel model predicts that 
restrictive rules can encourage more information transmission from in- 
formed committees than do open rules. We now turn to an analysis of 
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whether a commitment to restrictive rules by the floor can induce the 
committee to specialize in the first place. So now the game takes the 
following form: 


(1) F’ chooses whether or not the committee will report the bill 
under open or closed rule. 

(2) C decides whether to specialize by paying a cost k to learn w. 

(3) C proposes p*. 

(4) Under closed rule, F' votes p° up or down against SQ. Under 
open rule, F' chooses p/. 


If the committee does not specialize, F’ will decide based on its prior 
beliefs. Therefore, under the closed rule, she will veto any proposal 
other than p® = 0. Under the open rule, F passes pf = 0. Thus, 
non-specialization results in a policy of p = 0 for both rules. 

Now consider the committees specialization decision under open 
rule. If the committee specializes when c < 0, C' and F play the open 
rule separating equilibrium resulting in an overall payoff of —c? — k for 
the committee. If the committee does not specialize, then its expected 
utility from p = 0 is —c? — 67. So the committee specializes so long 
k <6. Ifc > 6 and the committee specializes, the committee and 
floor will play the pooling equilibrium which leads to p = 0 and an 
overall committee payoff of —c? — 6? — k. Thus, the committee will 
obviously not specialize under these circumstances. 

Now consider the closed rule. If c < 0, the separating equilibrium 
generates an outcome of c so that the committee’s utility of specializing 
is —k. Similarly, if c > 6, the utility of specializing is —(c—0)?—k. In 
both cases, non-specialization leads to a payoff of —c?—6?. Comparing 
these utilities, we find that it pays for the committee to specialize when 
k < 26c and c > 6 and when k < c2+6? andc < @. Since both of these 
critical values for k are higher than 6”, the committee often specializes 
under closed rule when it would not have under open rule. 


3.2. Implications. Gilligan and Krehbiel draws several implica- 
tions about institutional design by computing the floors utility under 
the different rules. The floor’s expected payoffs under open rule are 

Elur] =0 ifc<Oandk < @ 
Elur] =—@? ife>@andk> 6? 
while the payoffs under closed rule are 
Elur] =—e ife<Oandk< 246? 
Elur] =—-@ ife>@andk>c?+6? 

There are a couple of implications worth noting. First, F always 

is always better when C’s ideal point is close to the floor’s ideal 0. 
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This is the basis of Krehbiel’s argument that majoritarian legislatures 
should attempt to appoint committees that are representative of the 
preferences of the chamber. He contrasts this implication with that of 
the distributive theory of legislatures which predicts that committees 
will be composed of high demanders for the policies in the committees 
jurisdiction. 

Secondly, the model makes predictions about when F’ will prefer a 
closed rule. Specifically, F’ will choose a closed rule when c < @ and 
0? <k<c+06?. Thus, the committees most likely to receive closed 
rules are those whose ideal point does not diverge from the floor and 
those with intermediate specialization costs. 


4. Application: Informational Lobbying 


The traditional literature on lobbying in legislatures has uncovered 
a striking empirical regularity: interest groups almost always lobby 
their friends. Since these friends are likely to vote with the inter- 
est group anyway, this observation has been interpreted to mean that 
lobbying activities are not very consequential. 

However, Austen-Smith and Wright (1992, 1994) develop a model 
where groups do indeed lobby their friends, but that their efforts are 
important. The main premise of their model is that interest groups 
have private information about the consequences of a legislative de- 
cision that are unknown to the legislator. “Lobbying” consists of a 
group making a speech to the legislator. Since lobbying is assumed 
to be costly, groups will choose whether or not lobby the legislator. 
The main result of their paper is that groups will often lobby friendly 
legislators to counteract the lobbying efforts of opposing groups. 

Before analyzing the full model, we begin with a model with only 
one interest group and a legislator. The legislator is required to choose 
between two policies A and B. However, she is uncertain as to which 
policies she prefers. Assume that there are two states s = A or B 
such that the legislator prefers policy A in state A and B in state 
B. To keep things simple, we assume that u,(A) = 1 in state A 
and uy, (A) = —1 in state B. We assume u, (B) = 0 in both states. 
The legislator believes that s = A with probability p < 5. Thus, in 
the absence of any additional information provided by lobbying, the 
legislator will choose B. 

Suppose that there is an interest group G4 who prefers A to B. 
We assume that uc, (A) = 1 and we, (B) = 0 in both states. If G4 
decides to lobby, it pays cost c > 0 to learn the true state with certainty. 
Its ex ante beliefs about the state are the same as L’s. Once informed, 
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the group sends one of two messages m = A or B where the messages 
are to be interpreted literally. 

After observing message m, L may attempt to verify the group’s 
information by “auditing” the message. In doing so, L incurs cost 
k. If the message is found to be incorrect i.e. m # s, the group is 
penalized by an amount 0. 

Now consider the group’s possible lobbying strategies. First, sup- 
pose G4 always reports A independent of s. Then the message would 
be uninformative and L would always vote B if she does not audit and 
will choose the optimal outcome following an audit. Since the utility 
of an audit is 1 — k, it is easy to show that L’s best response to an 
“always A” strategy involves auditing if and only if p > x. 

Since an “always A” lobbying strategy does not affect the outcome, 
Ga would prefer not to lobby at all. Therefore, successful lobbying 
requires that Gy tell the truth at least some of the time. Suppose 
Gx told the truth all of the time. Then L would always follow such 
advice. However, this would give G4 the incentive to report A even 
when s = B. Since Gy has an incentive to deviate, this cannot be an 
equilibrium. 

Thus, any perfect Bayesian equilibrium to this game has to be semi- 
pooling. So consider an equilibrium where G4 always reports A when 
s = Aand reports A with probability 4 when s = B. Such a strategy 
leads L to update (using Bayes’ Rule) her belief that s = A to 


—— 
p+pu(l—p) 
In such an equilibrium, L must be indifferent to voting for A and 
auditing. The expected utility of voting A is 2p—1 whereas the utility 
of auditing is p—«. So the group will choose jz so that 


D 


pede 
or ra ae 
(Lp) (La) 


To close the model, L must choose a probability of auditing a so that 
Gx is indifferent between lying and being truthful when s = B. If 
G is truthful, it gets —1. If Gy lies, it gets -a(d+1)+(1-a)1. 
Therefore, L sets 
1 
a=5 rz 
Given these lobbying and auditing strategies, we can compute G‘4’s 
expected utility to determine whether or not it will choose to become 
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informed and lobby. Since L always chooses B in the absence of 
lobbying, the group’s utility from not lobbying is —1. If it does lobby, 
the group gets A for sure when s = A and an expected utility of —1 
when s = B. Thus, the ex ante expected payoff from lobbying is 
2p—1-—c. Thus, the group lobbies if and only if p > $. 

That this equilibrium requires 0 < yu < 1 suggests that we need 
1—p>x« > 0. Thus, it cannot be too costly for the legislator to 
audit. If it is too costly, the legislator will not audit which guarantees 
that the group has the incentive to always lie. However, if the group 
always lies, lobbying will be ineffective and the group will choose not 
to become informed. 

Now consider the model with two groups. Suppose now that there 
is a group Gg with preference opposed to G4 so that ug,(B) = 1 and 
UaG,(A) = —1. It also may learn the true state by paying a cost c. 
We assume that both groups decide whether to become informed and 
choose their messages, m, and mz, simultaneously. 

First, note that since L chooses B in the absence of lobbying there 
can be no equilibrium where Gz lobbies but G'4 does not. If Gg lobbied 
alone, it would incur cost c without altering the outcome. Austen- 
Smith and Wright interpret this result as implying that groups only 
lobby “friendly” legislators to counteract the lobbying of other groups. 

Since the outcome of G4 lobbying alone is outlined above, we need 
only focus on the outcome when both groups lobby. Consider the 
following equilibrium. Both groups send truthful messages. When 
Mma = Mg = S, the legislator believes that the true state is s. However, 
ifm, # mp, L audits the message and chooses her optimal policy. For 
auditing to be a best response to the out-of-equilibrium messages, we 
specify that p> « if mas 4 mp. 

Now we check that G4 will not deviate and choose an untruthful 
message. Clearly, it has no incentive to choose m4 = B when s = A. 
So consider whether it will choose m4 = A when s = B. Given G’s 
strategy, this results in a posterior of p. Since this message will lead 
to an audit, it results in a policy of B and a penalty of 6 for a total 
payoff of —1 — 6. Since telling the truth leads to a payoff of —1, Gy 
has no incentive to deviate. 

Turning to Gg’s decision, we need to check that it will not choose 
mp = Bwhens =A. As before, such a message leads to an audit 
and a penalty for Gg. Thus, it will not defect. 

Now we must verify the conditions under which both groups would 
prefer to lobby. Since lobbying by both groups leads to the full infor- 
mation outcome, the equilibrium utilities of G4 and Gp are 2p—1—c 
and 1 — 2p —c respectively. If only G4 lobbies, its utility 2p — 1—c 
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so that its utility from lobby is independent of G'g’s choice. However, 
note that Gp’s utility from having G4 lobby alone is 


2 Kp 26+2 
p+ (1p) fu(2a—1)-+ (1p) =1-2p- | PEE 
Therefore, Gg will only decide to lobby if p > ton Sc. 


Thus, we can characterize the equilibrium lobbying decisions of both 
groups as follows. If p < $c, neither group lobbies. If —* 3c > p> 
$C, only Gy lobbies. If p> i oC, both groups lobby. 
Austen-Smith and Wright interpret this perfect Bayesian equilib- 


rium as implying the following hypotheses. 


e Ceterus paribus, when a legislator is lobbied by groups from 
just one side of an issue, the only groups that lobby are those 
opposed to the legislator’s ex ante position. 

e The decision of a group to lobby an “unfriendly” legislator is 
independent of the lobbying decisions of opposing groups. 

e Conditional on a “friendly” legislator being lobbied by an op- 
posing group, a group’s decision to lobby that legislator is 
purely counteractive. 


5. Refinements of Perfect Bayesian Equilibrium* 


5.1. Sequential Equilibria. By now it should be clear that weak 
consistency lives up to its name by not being a strong enough constraint 
on off the path actions. In fact PBE need not even be subgame perfect, 
a fact demonstrated by the example illustrated in Figure 8.10. 

Insert Figure 8.10 here 

In this extensive form game, C’ is a potential candidate for office, 
and must decide whether to run or not. After this decision, a popular 
media source such as the local newspaper decides whether to endorse 
C or not. Following this decision, but without knowing the media’s 
decision, the incumbent J must decide how much campaign effort to 
exert.” The payoffs are chosen to reflect the fact that C prefers to 
enter only if he is endorsed and the incumbent chooses low effort, the 
media prefers to endorse C’ only if the incumbent chooses low effort, 
and the incumbent prefers to exert effort only when C is not endorsed. 

In Figure 8.10, the shaded branches denote a strategy profile and 
the numbers p = 0 and p = 1 denote beliefs. It is not difficult to see 
that the figure depicts a PBE. Given the belief that “not endorse” 
occurs with probability 1, the incumbent optimally selects high effort. 


>We can also interpret the endorsement and effort decisions as occuring 
simultaneously. 
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Moreover, given the expectation that endorsement and high effort will 
follow a decision to run, the optimal decision for C' is to not run for of- 
fice. Given this strategy profile, the incumbent’s information set is not 
reached and thus weak consistency does not restrict beliefs. Despite 
the fact that this is a PBE, the specified strategy profile is not even 
subgame perfect. To see this, consider the circled subgame that starts 
with m’s decision. Given that m is choosing endorse, high effort is not 
a best response. The pathology exhibited by this game is that we can 
write down strategy profiles for which their are non-trivial information 
sets that are several moves away from the equilibrium path. 

In a reasonably defined equilibrium, we would like I’s beliefs about 
m’s decision (conditional on reaching this information set) to be some- 
how consistent with player m’s strategy. Accordingly, we might con- 
jecture that (i) if C runs then either m endorses and J exerts low effort 
or (ii) m does not endorse and J exerts high effort or (iii) both m and 
I randomize. As we shall see combining sequential rationality and a 
stronger notion of consistency can avoid this pathology. 

In this section we limit ourselves to finite games and present several 
stricter equilibrium concepts. All of the concepts defined will involve 
sequential rationality as defined in Definition 8.2. Where these con- 
cepts differ is in the restrictions imposed on beliefs. 

First we need a bit of notation. Given a finite game a mixed 
strategy profile o(-) is a mapping that determines a lottery over avail- 
able actions at each information set. For a finite game such a strategy 
profile can be written as a vector (a(1,1),....,0(a,J/)...) with generic 
coordinate o(a,/) denoting the probability that action a is played at 
information set J. A completely mixed strategy profile selects every 
action at every information set with positive probability. A pure strat- 
egy is then a vector containing only 0’s and 1’s. A sequence of mixed 
strategies {o(-)”}°°, is said to converge to a mixed strategy o(-) for 
each I,a o(a,I)" converges to o(a,/). The notion of sequential equi- 
librium replaces weak consistency with a stronger condition. 


TF! 


DEFINITION 8.6. Given a finite extensive form game with imper- 
fectly observed actions, '"! a sequential equilibrium (SE) is a pair 
(o(-),b(-)) such that: (1) the strategy profile o(-) is sequentially ra- 
tional relative to the belief b(-), and (2) there exists some sequence of 
completely mixed strategies{a(-)"}°, and beliefs {b(-)"}°°, that con- 
verge to o(-),b(-) respectively and for some n’ ifn > n' b(-)” ts weakly 
consistent relative to the strategy profile o(-)". 


To demonstrate how this concept refines our notion of PBE, we first 
return to the game in Figure 8.10. Consider any completely mixed 
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profile o”(-) where we will assume that every pure strategy must be 
played with probability ¢,, where ¢, — 0 as n — co. If this sequence 
of mixed strategies is going to converge to the PBE in Figure 8.10, we 
require 0” (endorse, run) to be a mixed best response to o"(run) = En 
and o"(high ef fort,run) = 1—€,. However, m’s expected utility 
from endorse is therefore €, while its payoff to not endorse is 1 — 2é,. 
Therefore, since €,, converges to 0, there must be some N such that for 
alln > N, m prefers to choose o"(endorse, run) = €,, + 0. Thus, the 
equilibrium in Figure 8.10 is not the limit of these completely mixed 
equilibria and is not a sequential equilibrium. 

We can also see the restrictions that sequential equilibria place on 
beliefs. Weak consistency requires that the beliefs about the media’s 
decision correspond to the strategy, b"(endorse) = o”(endorse, run). 
Accordingly for a sequence of completely mixed strategies that con- 
verges to a profile playing not run and endorse if run with probability 
1, b(-)" must converge to beliefs that put probability 1 on endorse. 
Therefore, there are no sequences of completely mixed strategies and 
weakly consistent beliefs that converge to the strategies and beliefs de- 
picted in Figure 8.10. In fact it can be shown that every sequential 
equilibrium involves strategies that are subgame perfect. The proof of 
this result is left as an exercise. 

While sequential equilibrium is an improvement over PBE, in many 
applications the two concepts turn out to have equivalent equilibrium 
sets. For example in the classic signaling games considered earlier the 
concepts coincide. Fudenberg and Tirole (1991) have shown that 


PROPOSITION 8.9. (Fudenberg and Tirole 1991) If T®! has only 2 
periods or every player has at most two types then the set of PBE and 
SE coincide. 


5.2. The Intuitive Criterion. While the concept of sequential 
equilibrium can sometimes eliminate implausible PBE, it often contin- 
ues to permit implausible out-of-equilibrium beliefs. To address these 
problems, Kreps and Cho’s propose the intuitive criterion to further 
restrict the equilibrium set in many signaling games. 

Since we wish categorize certain off-the path beliefs as implausible, 
Kreps and Cho postulate beliefs should be concentrated on those types 
of senders who have the greatest incentive to defect. They illustrate 
their concept with what has become known as the Beer and Quiche 
game. We follow this tradition because this is the simplest possible 
game upon which the concept can be demonstrated, but offer some 
political embellishments rephrasing the interaction as a game between 
Saddam Hussein and George Bush on the eve of the American invasion 
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of Iraq. Consider a two-player signaling game in which the sender, 
Hussein has two possible types. He can have weapons of mass destruc- 
tion, 6 = w or not, ? = ~w. The receiver, Bush can either attack s, = a 
or not s, = ~a. Bush has no concerns about winning if he attacks, but 
needs to justify the attack with the claim that Hussein has weapons 
of mass destruction. Accordingly, he prefers to attack if w and he 
prefers not to attack if ~w. Prior to Bush’s decision, Hussein decides 
whether to allow weapons inspections s, = y or sy, = y. However, 
the result of a weapons inspection would not be realized before Bush’s 
decision of whether to attack or not. We assume that regardless of 
Bush’s decision type w suffers a cost of 1 from inspection as he is shown 
to be in violation of UN resolutions. Type ~w receives a benefit of 1 
from inspection as he is publicly vindicated. Regardless of Hussein’s 
type he would prefer not to be attacked, and in fact the cost of being 
attacked is sufficiently large that Hussein of either type is willing to 
make either decision regarding UN inspections to avoid attack. Bush 
on the other hand does not care directly about the inspections, but he 
prefers attacking as long as the probability of 6 = w is sufficiently high 
(greater than 5). Figure 8.11 depicts the game. 


Insert Figure 8.11 


It is not difficult to see that there are pooling equilibria in which 
weapons inspectors are allowed (s, = y) and there are pooling equilib- 
ria in which they are not allowed (s;, = ~y). In any pooling equilibria, 
Bush’s posterior must correspond to the prior. Thus if the prior prob- 
ability w is sufficiently low, Bush will not attack. To support pooling 
at sp, = y, it is necessary that both Hussein types prefer y to ~y. This 
results in the incentive compatibility conditions 


3 = EUa(y, ~w) 
2> EUy("y,w) 


Letting pr(a | ~y) denote the probability that Bush attacks after ob- 
serving ~w, the above conditions require that we have 


3 > Opr(a| ~y) + 2(1 — pr(a | ~y)) 


2 > Ipr(a| ~y) + 31 — pr(a| ~y)). 


This is true as long as pr(a | ~y) > $. In order for Bush to use a 


strategy in which pr(a | ~y) > 3 his posterior belief about Hussein’s 
type conditional on the off-the path action ~y needs to satisfy pr(w | 


~y) > 5. This posterior is not pinned down by Bayes’ Rule. Recall 


that the on-the-path pr(w | y) = } corresponds to the prior and is 


pinned down in a pooling equilibrium. So we have shown that there is 
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a PBE (and by Proposition 8.9 as SE) in which both types of Husseins 
pool at y and Bush does not attack. 

To support the other pooling equilibrium where both types of Hus- 
sein select ~y and Bush again does not attack, we need only specify 
off-the-path beliefs pr(w | y) > $. Following these beliefs, Bush at- 
tacks with probability at least 5 and so both Hussein-types prefer to 
select ~y and avoid attack. 

While they are both PBE and SE, Kreps and Cho argue that only 
one of these pooling equilibria is reasonable. Consider the equilibria 
in which both Hussein types select “y. This equilibria requires that 
the off-the-path belief satisfies pr(w | y) > 4. Is it reasonable for Bush 
to believe that Hussein is more likely to have weapons if he allows 
inspections than if he doesn’t? Recall that in this equilibrium 6 = w 
is getting his maximal possible payoff. No inspections and no attack 
result in a payoff 3. However, the defection to y and not attack results 
in a payoff of 2 for w. Such a deviation is not very desirable under the 
assumption that the deviation will not trigger attack. If an attack were 
triggered by y then the defection is even less attractive. On the other 
hand consider the potential incentive for a deviation by a type ~w. In 
equilibrium he gets payoff 2. However if his defection did not result 
in an attack he would get utility of 3 (and thus improve his situation). 
Thus, it seems intuitive that if a defection were observed that it would 
most likely be committed by ~w. Kreps and Cho argue that Bush would 
be foolish to interpret y as suggestive of w. Instead they imagine that 
the only justification for such an off the path deviation is that a type 
~w Hussein might deviate to y and send the following justification. 


Dear W: 


Sorry for past squabbles with your old man. About 
this recent disagreement, I know that you are expecting 
me to choose ~y and this doesn’t tell you anything—its a 
pooling equilibrium after all (you remember pool tables 
from Yale don’t you). But I am not going to do this, 
because I actually don’t have any weapons and I want 
to show this to the world, so I am going to make myself 
even better off. Beside avoiding the tanks, special ops, 
and media barrage when you decide not to attack, ’m 
also going to allow weapons inspections in to show that 
I have been well-behaved. You should trust that this 
action indicates that I really have no weapons because if 
I did have weapons and I expected you not to attack if I 
didn’t allow the inspections (which is the equilibrium we 
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are playing) then I would only hurt myself by letting in 
the inspectors. 


Sincerely 


SH 


Of course this type of communication is not modeled in standard 
signalling games. The point is that given the Bush strategy, one type 
can possibly gain from the off-the-path deviation, while the other type 
can only lose. In such a setting, the off-the-path beliefs should be 
concentrated on the type that stands to gain. Note that the y pooling 
equilibria is immune from this criticism. The only type that stands to 
possibly gain from choosing ~y is w. So the beliefs justifying Bush’s 
attack following ~y are justified. 

We now present the intuitive criterion in slightly a more rigorous 
manner and so we require a bit more notation. Let I’° denote a simple 
signaling game with two periods and two players. Player 1 has a 
type space O and a message space M/. Player 2 observes player 1’s 
message m and selects an action from A. For simplicity assume that 
all of these sets are finite. For the more complicated case of non-finite 
sets, the following conditions can be modified but some technical issues 
may be encountered. While player 1 knows her type, player 2 only 
knows that 1’s type is drawn from some probability mass function f(-) 
on © and player payoffs are given by utility functions u,(m,a,@) and 
ur(m,a,0). Accordingly, a mixed strategy profile is a message function 
o,(@) that selects a lottery on M for every 6 and an an action function 
o,(m) that selects an action for each possible message. By o,(m, 6) and 
o,(a,m), we denote the probability that m is played by a sender with 
type @ and the probability that a is played by an r that has observed m 
respectively. An equilibrium (PBE or SE) also involves a belief ju(0 | 
m).Given a signaling game and a sequential equilibrium to the game, let 
U(0@) denote expected utility to player 1 of type 6 from the equilibrium 
profile. Finally let A denote the set of probability distributions on 
8 and let BR,(m) = Up@yea{arg maxgea >) Ur (mM, a, O)p(8)} denote 
the set of actions by r that maximize the receiver’s expected utility for 
some beliefs about #. We say an action r is rationalizable if it is an 
element of BR,(m). 


DEFINITION 8.7. An SE (o5(-),os(-), F(- | -)) satisfies the intuitive 
criterion if for any message m such that S>o,(m, 0) f(0) = 0, the pos- 
terior belief (0 | m) > 0 only if UZ(0) < maxXaesr,(m) Us(™M, a, A). 
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In words, an intuitive equilibrium requires that out-of-equilibrium 
beliefs put zero probability on types that could not gain from the ob- 
served deviation under some expectation that the receiver would re- 
spond to the deviation by playing a strategy from her set of best re- 
sponses. 

To further demonstrate the concept, we modify the entry deterrence 
game considered above. Now instead of restricting the message space 
to be {WC,~ WC} we allow the incumbent to select a level s; € Rj, at 
a cost cs; where c is either c,, or c, depending on the incumbent’s type. 
Let the value of office be 1 for the incumbent, so if in equilibrium she 
accumulates s’ and wins with probability 7 (because of randomness in 
the challenger’s decision and the randomness associated with election 
in a contested race) her payoff is 7 — cs’. As you will see there are 
pooling, partially pooling and separating equilibria to this game. It 
can be shown however that there is exactly one intuitive equilibrium. 
This equilibrium is a separating equilibrium. We leave the analysis 
of this game as an exercise, and provide a solution in the back of the 
book. Students are strongly encouraged to devote the time to work 
through these two problems before looking at our solution. 

The literature on refinements is quite large and refinements to the 
intuitive criterion have appeared in applications. Commonly, models 
with types spaces with more than two element require stronger re- 
finements. This is because several types might stand to gain from a 
defection for different best responses by the receiver. In such situa- 
tions, stronger refinements such as universal divinity (Banks and Sobel 
1992).° Since universal divinity has been used in numerous political 
applications, we present a definition and an example. 


DEFINITION 8.8. An SE (o,(-),o5(-), F(- | -)) satisfies universal di- 
vinity if for any message m such that S> o,(m, 0) f(0) = 0, the posterior 
belief (0 | m) > 0 only if there exists an action a € BR,(m) such that 
Ux(0) < us(m,a,0) and for every 6’ 4 0 Ux(0') > us(m, a, 6’) 


In comparing the two refinements, universal divinity is more strin- 
gent in the set of types that it allows the posteriors to place positive 
probability. In the case of universal divinity, the posteriors can only 
put weight on a type if there is rationalizable action that makes the de- 
viation desirable for this type and not desirable for any other type. In- 
formally, universal divinity requires that off the path beliefs put weight 
only on the types “most likely” to deviate. 


6We suggest students to seek out the original Cho and Krep and Banks and 
Sobel pieces. In addition, Banks (1991) is an exemplary presentation of signaling 
games and refinements in political science. 
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To contrast the intuitive criterion and universal divinity, we con- 
sider a version Michael Spence’s model of job signaling. We consider 
an application to the question of political reform as a signal to gain 
foreign investments or loan guarantees. Suppose a developing coun- 
try has type 0 € {1,2,3} with 0 measuring the nations potential to 
successfully repay debts (higher numbers are better). Let 7, and 72 
denote the probability of types 1 and 2 (with type 3 occurring with 
probability 1—7,—72). The country must select a level of observable 
political reform r € R}.. The pain associated with reform is dependent 
on the nations type. After observing r the IMF determines a financial 
package f € Rj for the nation. We assume that the receiver’s goal is 
to match f with the nation’s 6r. Payoffs are as follows, given type 0, 
reform r and package f, the nation’s utility is f — ar. 

We first consider the case of 7; + 72 = 1 (so there are just two 
types of developing countries). In this case, there are pooling, partially 
pooling and separating sequential equilibria. However, the intuitive 
criterion selects a unique equilibrium. We sketch out the argument 
here. Consider Figure 8.12 which depicts indifference contours over 
pairs of messages and responses for senders of types 1 and 2. Since 
all senders prefer more funds and fewer reforms, movements to the 
northwest quadrant are desirable from their perspective. 


Insert Figure 8.12 


Consider a pooling (or partially pooling) equilibrium in which both 
sender types are selecting the same level r? with positive probability. 
After observing r? the receiver knows that the posterior probability 
of 6 < 2 is greater than 0 and thus in any sequential equilibrium the 
package that corresponds with r?, f(r’), must be less than 2r?. We 
show that there exists a message r’ > r? such that if f(r’) = 2r’ 
type 2 nations would prefer the deviation and type 1 nations would 
not. In this case the intuitive criterion implies that beliefs must place 
probability 1 on @ = 2 if r’ is chosen. Given these beliefs following r’ 
the package f(r’) = 2r’ is the unique sequentially rational package for 
the receiver. In the notation of our definitions above we have 


DCA Cae eats 


y! 
(73272) = 2r' = a 
Us(1) = fl) —7? 


GAT 291). = or 7" 
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Accordingly, U7(1) > u,-(r’,2r’,1) requires only f(r?) — r? > r’ while 
Ux (2) < u,(r’, 2r’, 2) requires only f(r?) -— 5 < ar’ = or oy = 
f(r’). Both of these inequalities can be simultaneously satisfied. See 
Figure 8.12 for the region of such values of r’. To recap, in a pooling or 
partially pooling equilibrium in which both types play r? with positive 
probability, the intuitive criterion requires that following the (possibly 
off the path) message of r’ beliefs assign probability 1 to type 2. Thus if 
r’ is played the financial package will be 2r’.. The value r’ was chosen 
so that type 2’s strictly prefer message 1’ to r? meaning that type 
2’s cannot put positive probability on the message r? contradicting 
the assumption that there is an intuitive equilibrium in which both 
types player r? with positive probability. | Having ruled out all but 
separating equilibria, we claim that the intuitive criterion selects a 
unique separating equilibrium. We leave this as an exercise below. 


Insert Figure 8.13 


We now assume that 6 = 3 occurs with probability 1—7, —72 > 0. 
The first question to address is whether the intuitive criterion still 
eliminates all pooling or partially pooling equilibria. The answer is 
no. To see this suppose that types 1 and 2 are both playing a message 
r? with positive probability, and 6 = 3 plays a pure strategy r? > r?. 
It can be shown that this happens in some sequential equilibrium. Our 
argument before was that the intuitive criterion required that following 
some reform r’ that is higher than r? posterior beliefs are concentrated 
at the higher type. However, with the third type present this turns 
out not to be the case. We can now satisfy the intuitive criterion with 
posteriors putting weight 0 = 3 following a reform r’ > r?.. With such 
beliefs, sequentially rational choices of f might lead 6 = 1 to prefer the 
deviation to the equilibrium payoff. More specifically, the requirement 
that f(r’) < 2r’ need no longer hold. Now it is just the case that 
f(r’) < 3r’ needs to hold. With f(r’) this big, type 0 = 1 might be 
willing to deviate with the expectation that a deviation will result in 
f(r’). Accordingly in order for type 0 = 2 to signal that it is not type 
1, it needs to send a message at least as high as r™™ as depicted in 
Figure 8.13. However, for every level of r > r™™", there are possible 
best responses f(r) that make type 2 worse than under equilibrium. 
Notice that there is space between type 2’s indifference curve and the 
line f = 2r. Since the intuitive criterion only requires that type 2 
expect a response of f > 2r for an r greater than r™™”, it cannot be 
sure that the deviation is desirable. 

However, if we test whether this partial pooling can happen in a 
universally divine equilibria the answer is different. Again suppose that 
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types 1 and 2 are both playing a message r? with positive probability, 
and 6 = 3 plays a pure strategy r? > r?. Under universal divinity, a 
message of r’ that is slightly larger than r? must result in a posterior 
concentrated at 6 = 2. To see this, note that for values of r to the 
right of r? type 1’s indifference curve is above type 2’s. This means 
that for pairs (r’, f(r’)) that lie between the two indifference curves, we 
have U*(2) < u,(r’, f(r’),2) and U*(1) > u(r’, f(r’), 1). Moreover, 
since type 3 is getting f = 3r, his utility is higher in equilibrium than 
under the deviation. Accordingly, universal divinity requires that a 
message of r’ result in beliefs concentrated at 6 = 2 and thus sequential 
rationality requires that f(r’) = 2r’, and so type 2 would gain from the 
deviation. It is left as an exercise to show that there is exactly one 
universally divine equilibrium in the game with 3 types. 


6. Exercises 


EXERCISE 8.1. Consider the game of Figure 8.7, with the payoff to 
the path B, NR being (5,0) instead of (4,0). Characterize all of the 
PBE (mized and pure strategy) to the game. 


EXERCISE 8.2. Consider the game of Figure 8.7, with the payoff to 
the path ND being (w,5) instead of (0,5). Here w is an exogenous 
parameter known to the agents that is ranging from |—2,5]. For what 
regions of this range are there PBE in which ND occurs with positive 
probability. In other words for what subset of |—2,5] are there PBE in 
which ND is played. 


EXERCISE 8.3. In the game depicted in Figure 8.8, show that there 


_ Jj bifO=a 
is not a PBE with m(0) = { Ne ee 

EXERCISE 8.4. Find all of the PBE of the game depicted in Figure 
8.14. 


Insert Figure 8.14 here 
EXERCISE 8.5. A Model of Political Repression 


Suppose that in each of two periods, society must decide whether 
to protest the policies of the state. The state may either acquiesce or 
repress. Society gets 1 if the state acquiesces, —1 the state represses, 
and 0 if it does not protest. 

Suppose there are two types of states: Moderate and Hardline. The 
moderate state (MV) gets 0 if the protest does not take place, —2 if it 
acquiesces, and —3 if it represses. The hardline (H) state gets 0 for no 
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protest, —2 for repression, and —3 for acquiescing. Let po be the prior 
probability that the state is M. 
a. In the second period, what is the critical value p* such that 
S protests if p; > p*(where p; is S’s updated belief that the 
state is M)? 
b. Is there a separating equilibrium with these strategies? 


M : {acquiesce, acquiesce} 
H : {repress, repress } 
S': {protest, stay home if repressed in period1} 
If so, what values of po does it hold? 
c. Is there a pooling equilibrium in the first period with these 
strategies? 
M : {repress, acquiesce } 
H : {repress, repress} 
S': {protest, stay home if repressed in period1} 
If so, what values of po does it hold? Is it consistent with the 
intuitive criterion? 
d. Is there a pooling equilibrium in the first period with these 
strategies? 
M : {repress, acquiesce} 
H : {repress, repress} 
S: {stayhome, stay home if repressed in period1} 
If so, what values of po does it hold? Is it consistent with the 
intuitive criterion? 
e. Compute a semi-pooling equilibrium where / represses in the 


first period with probability g. For what values of po does S 
protest? 


EXERCISE 8.6. Show that there are no partial pooling equilibria in 
open rule version of the Gilligan-Krehbiel model. 


EXERCISE 8.7. Compute the other partial pooling equilibria for the 
Warchest Game. 


EXERCISE 8.8. Consider the open rule version of the Gilligan Kre- 
hbiel model described above, but suppose that their are two committee 
members with c > @ that observe the state and make simultaneous mes- 
sages to the floor. Does a separating equilibrium exist. Now suppose 
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that their are three such committee members does a separating equilib- 
rium exist? 


EXERCISE 8.9. Show that given a finite extensive form game if 
o(-),(-) ts constitutes a sequential equilibrium then o(-) is subgame 
perfect. 


EXERCISE 8.10. Show that in a one sender, one receiver signaling 
game if the senders type space has two elements the set of PBE and SE 
coincide. 


EXERCISE 8.11. Prove Proposition 8.9. 


EXERCISE 8.12. Show that in the Hussein-Bush game the pooling 
equilibria with y on the path does not violate the intuitive criterion. 


EXERCISE 8.13. Can you modify the probability of w in Figure 8.11 
to support the observed path of play (~y and a) as a PBE? 


EXERCISE 8.14. Characterize the levels of s; and entry lotteries 
that are supportable in a PBE to the modified entry game with message 
space Ri. ((note this question should be answered in the book)) 


EXERCISE 8.15. Characterize the unique intuitive equilibrium to the 
modified entry game with message spaceR}, ((note this question should 
be answered in the book)). 


EXERCISE 8.16. In the reform for loan guarantees game with 2 types 
show that the unique intuitive equilibrium involves r(1) = arg max{r — 
1 


r?} = } and r(2) = argmax{r — $r?} =1. 


EXERCISE 8.17. In the reform for loan guarantees game with 8 types 
show that the unique universally divine equilibrium involves r(1) = 
argmax{r — r?} = $ and r(2) = argmax{r — $r?} = 1 and r(3) = 
arg max{r — ar} = 


Be 


2 
3 
oe 


CHAPTER 9 


Repeated Games 


Many important models in political game theory are based situa- 
tions consisting of agents playing the same game repeatedly over time. 
In many of these cases, the authors are interested in how certain social 
practices like conventions, norms, cooperation, and trust are sustained 
when actors may appear to have short run incentives to deviate from 
the expected behaviors. Another significant application of “repeated” 
is to understand how social dilemmas such as the Prisoner’s dilemma 
can be solved without recourse to centralized authority (Taylor 1976). 

The most interesting conceptual issue in such games is the extent 
to which repetition creates the opportunity to sustain more behavior as 
Nash equilibria than is possible in single-shot games. As we will see, in 
general, the set of Nash equilibria is much larger in repeated games than 
the corresponding static versions. This is because expectations about 
the future can lead to playing strategies that would not be optimal in 
a static context. 

To generate some intuition as to how expectation about the future 
can influence behavior, consider the following abstract normal form 
game. 


Table 9.1 


T [8,8] 0,0 / 1,9 


M [0,01/5,5* | 0,0 
B |9,1| 0,0 [3,3" 


Note that if this game is played only once there is only two Nash 
equilibria: (7, M) and (B, R). Even though the strategy profile (T, L) 
provides the highest aggregate payoffs, it is not an Nash equilibrium 
since player 1 would defect to B and player 2 would would defect to 
R. Now consider what happens if this game is played twice. Suppose 
player 1 chooses the strategy, “play T’ in period 1 and play M in period 
if player 2 plays L in period 1. Otherwise play B in period 2.” Further- 
more, suppose that player 2 chooses the strategy “play L in period 1 
and play MV in period 2 if player 1 plays T in period 1. Otherwise play 
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R in period 2.” Note that this pair of strategies is a Nash equilibrium 
where the “good“ outcome (T’, L) is played in the first period. To see 
this note if either player defects, her payoff 12 which is less than the 
equilibrium utility of 13. In fact, these strategies not only constitute 
a Nash equilibrium but the equilibrium is also subgame perfect. Since 
(B, R) is an equilibrium of the one-shot game, it is an equilibrium to 
all of the subgames that follow actions other than (7, L). Note that 
(T, L) cannot be a Nash equilibrium to the second subgame. 

By repeating the game the players can use a “norm” that if there 
is cooperation in the first period, the good equilibrium will be played 
in the second period. Otherwise, the bad equilibrium will be played. 
A second important point is that repeating the game can only improve 
the player’s average utility. Since playing (B, R) in both periods is also 
a subgame perfect Nash equilibrium, repetition cannot lower average 
utility below (3,3). As we will see almost any set of payoffs can be 
a Nash equilibrium if the game is long enough and the players care 
enough about the future. 


1. The Repeated Prisoner’s Dilemma 


It is often argued that trade policy among nations is an example of 
the Prisoner’s Dilemma played repeatedly over time. It is generally 
thought that the world economy does better when all nations agree to 
free trade, but that individual countries might do better by protecting 
their domestic economy. Given this tension, the question arises as to 
how to sustain free trade regimes. One answer is that free trade can 
be supported as an equilibrium in a repeated game where trade wars 
begin whenever a major country defects from the trade agreement. 
To illustrate this argument, consider the following representation of a 
trade policy dilemma between the US and the EU. 


Table 9.2: Free Trade Game 
US\EU_ | Free Trade | Protect 

Free Trade 10, 10 1,12 
Protect 12.1 4,4 


Obviously, if the game is just played once, the unique Nash equi- 
librium is the strategy profile (Protect, Protect). Suppose that it is 
played twice (ignoring the discounting of future payoffs), then the strat- 
egy sets for each player are (where the period 2 strategies depend on 
the strategy of the other player) 
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PT, FT2F Ts, FT| FT2Po, FT| P2FT2, FT PoP2, PP T2FT2, PF T>2Po, 
P,P,FT>, P, PoP. 


where FT, F'T)P, means “play FT in period 1 and play F'T in period 
2 if the other country plays F'T in period 1 otherwise play P.” 


Table 9.3: Two-Period Free Trade Game 

US\EU FT]FToFT2 | FT{FT2P2 | FT1PeFT2 | FT1PeP2 | PiFT2FT2 | PjFT2P2 | PyPeFT2 | P1P2Pa 
FT1FT2FT2 20,20 20,20 11,22 11,22 11,22 11,22 2,24 2,24 
FT|FT2P9 20,20 20,20 11,22 5,16 
FT P9FT92 22,11 22,11 14,14 2,24 
FT PoP 22,11 22,11 14,14 5,16 
P1FT9FT2 22,11 13,13 22,11 5,16 
P1FT2P2 22,11 13,13 22,11 8,8 
P1PoFT2 24,2 16,5 24,2 16,5 14,14 5,16 14,14 5,16 

P1PoP2 24,2 16,5 24,2 16,5 16,5 8,8 16,5 8:8" 


Unlike our first example, repeated the game only once does not 
effect behavior as (P, P2P2, P; P2P3) is the only Nash equilibrium. This 
result can be generalized to any finite number of periods. In the 
last period, each country will like to protect. Since this is known in 
the penultimate period, each country will have an incentive to protect 
in this period as well. This process unravels until each country is 
protecting in every period. 

The reason that we were able to induce some cooperation in our first 
example was that the first period behavior helped coordinate between 
multiple equilibrium in the second period. The good equilibrium was 
used as a reward while the bad equilibrium was used as a punishment. 
However, since the Prisoner’s Dilemma has but one Nash equilibrium, 
it is impossible to encourage cooperation with the promise coordinating 
on a good equilibrium in the future. 

However, if the game lasts an infinite number of periods, this ceases 
to be an issue. Now suppose that the “good equilibrium” is to free 
trade in every period while the “bad equilibrium” is to protect in every 
period. Since there is no last period, the good equilibrium does not 
unravel as it did in the finite case. Thus, in every single period, 
cooperation is sustained by the reward of the good equilibrium and the 
sanction of the bad. 
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2. The Grim Trigger Equilibrium 


To see how infinite repetition eliminates the “last period” problem, 
consider the following strategy in the infinite period trade game: “Play 
free trade in every period until the other country protects, then pro- 
tect forever.” This is known as the grim trigger strategy, because any 
failure to cooperate leads to the non-cooperative equilibrium in all fu- 
ture periods. If each country plays this strategy, both receive 10 in 
every period. Assuming that both countries discount the future at a 
common rate 6, the long-term utility of this strategy is i To show 
that this strategy is a Nash equilibrium, we must show that neither 
player is willing to defect. It is necessary and sufficient to show that 
no player is willing to defect for one period. Because of each stage 
game is identical, either a player will want to defect in every period 
or in no period. Defection from this strategy gives the defector 12 in 
the period of the defection. Since the other player will protect forever 
following the defection, the defector’s best response is to protect for- 
ever following the defection. Therefore, the defector gets 4 in every 
period following the defection. The utility from defecting is therefore 


12+ ~. Thus, the grim trigger strategies are a Nash equilibrium 
to the repeated prisoners dilemma if and only if a > 12+ a or 
6> . So as long as the players are sufficiently patient (6 large), the 
grim trigger strategy is a Nash equilibrium. We can also show that 
the grim trigger equilibrium is subgame perfect. A proper subgame to 
this game is also an infinitely repeated prisoner’s dilemma. Since the 
grim trigger equilibrium is a Nash equilibrium for the whole game, it 
must be an equilibrium in each of the subgames. 


Now consider a generalized Prisoner’s Dilemma 


Table 9.4: Generalized Prisoner’s Dilemma 
1\2 Cooperate | Don’t cooperate 
Cooperate 
Don’t cooperate 


wherec >a>b>d. Using exactly the same steps as above, the grim 
trigger strategy is a SPNE if and only if 4; > c+ {4 or 


c—a 


> 
eae 


'See chapter 3 for a discussion of time discounting and the calculation infinite 
sums of dicounted utilities. 
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Thus, we find that cooperation is harder to sustain (requires a higher 
discount factor) when: 


(1) c is large relative to a and b. 
(2) a and 6 are roughly equal. 


3. Tit-for-Tat Strategies 


The grim trigger strategy is not the only SPNE to the infinitely 
repeated prisoner’s dilemma which sustains the cooperative outcome. 
Many authors find the grim trigger equilibrium unrealistic because it 
predicts that cooperation disappears forever following a single defec- 
tion. Therefore, it is not very robust to mistakes by the players. It 
assumes that the player’s cannot renegotiate a return to the coopera- 
tive phase which they would clearly have a incentive to do. However, 
if we assume that players can engage in such renegotiation, cooper- 
ation vanishes because the uncooperative equilibrium is no longer a 
deterrent. 

An alternative SPNE is based on “tit for tat” strategies of the 
following form “cooperate until your opponent cheats. Then cheat 
until your opponent cooperates, then cooperate.” Note that there are 
two possible subgames: 


(1) A sub-game where both players are expected to cooperate in 
the next iteration. We call this the cooperation phase. 

(2) A sub-game where the defector is supposed to cooperate and 
the other player is supposed to punish the defector by not 
cooperating in the next iteration. We call this the punishment 
phase. 


Consider the first subgame. Again the equilibrium utility is <4, 
but the utility from a defection is a bit more complicated. Defecting 
gives a one -period utility of c. In the next period, the defector gets 
d from cooperating and being punished by the other player. In third 
period, the game returns to cooperation. Note that since we are intent 
upon establishing the existence of a Nash equilibrium, we need only 
check for one time deviations. Therefore, we assume that cooperation 
lasts forever following the punishment phase. Thus, the total utility 
isc+dd+0a+6°at+... =c+éd+ ua Thus, a player will not defect 


62a 


during this type of subgame if >4; > c+ dd + 7-4 or 


c—a 
6> 
— a-—d 
Now consider the second subgame. The equilibrium utility for the 


previous defector d+ *% which cooperating in every period while the 
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other player punishes him in the first period. Ifa player defects from 
this subgame, they get b in the current period, d in the next period, 
and a after that. Thus, a player will not defect if d+ me > b+dd+ oa 


or Roa 
j= 
—~ a-—d 
Given these two results, we have established that if 6 > max { at ond } the “tit 


for tat” strategies constitute a SPNE. Intuitively, we might expect tit- 
for-tat strategies to sustain more cooperation than the grim-trigger if 
the payoff of d were a sufficiently large deterrent against defecting in the 
cooperation phase. However, this effect is counteracted by the fact that 
if d is sufficiently bad, a defector has a stronger incentive to defect from 
the punishment phase. In fact we can show that max {a4 — dees 
so that cooperation is always easier to sustain under the grim trigger 
strategy than under tit-for-tat.? However, note the following: 


(1) The grim trigger strategy is not optimal when players may 
make mistakes. Tit-for-tat may be better because the effect 
of mistakes is not permanent. 

(2) The grim trigger strategy is not renegotiation proof. Both 
players could do better by renegotiating to leave the punish- 
ment phase and return to the original equilibrium. However, 
if both players foresee this possibility the punishment phase 
will not be an effective deterrent. Tit-for-tat is renegotiation- 
proof. The punisher gets higher utility in the punishment 
phase and will not wish to renegotiate. 


4, Intermediate Punishment Strategies 


The grim trigger and tit-for-tat strategies represent just two of the 
possible supergame strategies that may sustain the cooperative out- 
come. We can generalize this class of strategies to include strategies 
that involve punishment phases of intermediate length. Consider the 
following strategies: 

(1) Cooperate until your opponent defects. Then do not cooper- 
ate for k periods. Return to cooperating after the punishment 
phase ends. 

(2) Cooperate until your opponent defects. Then do not cooper- 
ate for k periods if your opponent cooperates. If your oppo- 
nent does not cooperate at any point during this punishment 
phase, begin a new punishment phase of k periods. 


See Axelrod (1984) for evidence that real-life players typically choose strategies 
resembling tit-for-tat. 
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Strategy 1 is similar to the grim trigger strategy in that a punish- 
ment consists of a reversion to the strategy pair (don’t cooperate, don’t cooperate). 
However, now the punishment phase is finite. The second strategy is 
similar to tit-for-tat in that any defector is punished by cooperating 
while the other player does not. 

We consider strategy 1 first. Clearly there is no incentive to defect 
during a punishment phase since mutual non-cooperation is a Nash 
equilibrium. We need only consider defections from a cooperation 
phase. Now the payoff from a single defection during a cooperative 
phase consists of the one period gain from defecting, the discounted 
utility of b for k periods, and the utility of getting a every period of 
the end of the punishment phase.? Thus, using the rules for sums 
of discount factors we araercugae in chapter (choice theory), we can 
write this utility as c+ sorb + oa Thus, sustaining cooperation 
requires that a (1 —6**") > (1— 5) c+ (5 —6**") b. While we cannot 
get a closed form for the critical value of 6, note that we can re-write 
this expression as 


C=O. ag = 0 

c—b are C=) 
Note that the first term on the right side of the inequality is the critical 
value for the grim trigger strategy while the second term is positive for 
any finite k. Thus, not surprisingly, it is harder to sustain cooperation 
with a finite punishment phase. However, in a model where players 
may make mistakes, this equilibrium may be preferred to the grim 
trigger strategy.* 

Now we consider strategy 2. First lets consider a defection from 
the cooperation phase. The payoffs from a defection consist of a one 
period benefit c, a punishment payoff of d for k periods, and a return 
top cooperative payoffs a at the end of the punishment. Summing 
all of these up generates c + sod + ae a. Simple algebra reveals 
that this payoff is lower than the payoff from defection in the tit-for-tat 
equilibrium by 2 a (a—d). Thus, increasing the length of the pun- 
ishment phase decreases the incentive to defect from the cooperative 
phase. 

However, increasing k may not make such an equilibrium easier to 
sustain as it reduces the incentive to comply in the punishment phase. 
To see this, consider the payoffs from defecting from the punishment 


é6> 


3 Again the logic of Nash equilibrium suggests we can ignore the possibility of 
future defections. 
‘Like the grim trigger SPNE, this one is not renegotiation proof. 
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phase. These payoffs consist of getting b for one period, d for k pe- 
riods, and then returning to a or b + & ed + o a. The payofts 
from adhering to the equilibrium in the Simihnent phase depends on 
which period of the punishment phase the game is in. Since we have 
to verify compliance in each period, we need to ensure compliance in 
the period where the payoff to compliance is lowest, the first period 
of the punishment phase. pore ine _utility for complying with the 
punishment in this period is 1d +25 3-32. Thus, compliance with the 
punishment requires 


This critical value is clearly diminishing ink. A SPNE in these strate- 
gies requires that both conditions on 6 be satisfied. 


5. The Folk Theorem* 


A common theme of our examples is that so long as the agents 
are sufficiently patient outcomes that are not Nash equilibria in static 
games can be supported as SPNE of infinitely repeated games. This 
result generalizes significantly. In fact, any individually rational payoff 
to an infinitely repeated game can be sustained as a SPNE if agents 
are sufficiently patient. This important result has been well known for 
so long that no one knows who derived it first. It has therefore been 
afforded the status of a Folk theorem. In this section, we formally 
prove a version of this result. 

The primitives of a repeated game are a normal form stage game 
T = (N,S,u) and vector of agent discount rates 6 = (61,...,6n). In 
each period t € {1,2,3,...}, the agents play the normal form game I. 
The game I is often called the stage game to distinguish it from the 
repeated game. Before agent 7 selects si € S;, her strategy in period t 
she observes the strategy profile s*~! played in period t—1. Moreover, 
we maintain the assumption of perfect recall, meaning that s{ can be 
conditioned on the history h’~* = (s’,.....,8°') € St: sie ot 
The null history is h° = @. <A pure strategy for player i is then a 
sequence of mappings {s‘(h'~!) : S*' — S;}°°,. A mixed strategy is a 
sequence of mappings {a‘(h'~!) : St~' + A(S;)}?°,. Given a sequence 
of lotteries over stage game profiles {o'}?, agent 7’s expected utility 
is given by BU;({o°}%,) = (1 — 6;) 772, 6F “Bo+u(st) where E,+u;(s*) 
takes the expectation of u;(s‘) over the mixture o’. The multiplier 
(1 — 0;) is included so that for a constant sequence o*, EU;({o'}%,) = 
u;(a"). We denote the repeated game induced by a stage game, by 
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T° = (N,S,u,6). Of course a repeated game is also an extensive form 
game and our notions of NE and SPNE are well defined in the repeated 
game. 

We now focus on repeated games generated by finite normal form 
stage games. Given Nash’s theorem, we know that every such stage 
game has at least one mixed strategy NE. It is not surprising then that 
every such repeated game has as a mixed strategy SPNE the infinite 
repetition of the stage game mixed strategy NE. 


PROPOSITION 9.1. If o* is a SPNE of the stage game then the 
repeated game profile a'(h'"') = o* for every (h'~') for every t for 
every 1 is a SPNE of the repeated game. 


An interesting feature of repeated games is that the set of SPNE is 
usually very large. The class of results termed ” Folk theorems” serve 
to quantify the set of equilibrium payoffs that are supportable in an 
equilibrium. We prove a particularly useful and simple Folk theorem. 
We first need several definitions. 


DEFINITION 9.1. The payoff vector v € R” is individually ratio- 
nal if 


v;> min {mages =) : 
s_;ES_; | 8;€S; 

The value min, _,<s_, {maxs,es, Ui(S;,5_;)} is the minimum stage 
game utility that player 7 can attain when she plays a best response. 
This value is identified by letting the players —i select s_;so as to 
minimize the utility to 7 of playing a best response to s_,;. 


DEFINITION 9.2. The payoff vector v € R" is feasible if there is 


some sequence of pure strategy stage game profiles {s'}%, such that for 
each i € N, BU;({s'}9,) = u. 


PROPOSITION 9.2. For every feasible and individually rational pay- 
off vector v € R” there is an n-tuple of discount rates 6 s.t. the payoff 
vector v occurs in a NE of the repeated game with the discount rates 6. 


PROOF. Assume that v is feasible and individually rational. Let 
{s} be a strategy profile that calls for playing the strategy that attains 
the payoff vector v as long as no one has previously deviated from this 
strategy or more than two players have deviated, and plays the strategy 
{s = argmin, ,es_, {Maxs,cs, ui(S;i, s_;)}} which punishes the unique 
player that deviated in all subsequent periods. At any period t the 
payoff to i of playing {s’’} is v; and the payoff to deviating is bounded 
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by 
Ps t t+1 * 
(1 — 6;)u; + 6;(1 — 4;) max U;(s) +06; min {mages sab 
This value is less than v; if 
MaXgcg U;(S) — V; 


6; > : : 
‘ MaxXseg u;(s) — MMs _jeS_; {max,,cs, Ui(Si, s_i)} 


Since max,eg ui(s) > vu; > min, ,es_, {MaXs,cg, Ui(Si, $_;)} the right 
hand side is strictly less than 1. Thus as long as this condition is 
satisfied for each i € N the conjectured strategy profile is a NE to the 
repeated game. LC) 


The equilibria used in the proof need not be SPNE as the punish- 
ment might be very costly to impose. We can quantify a set of payoff 
vectors supportable in SPNE to the repeated game using reversion to 
stage game NE strategies as the punishment. 


PROPOSITION 9.3. If v € R” is a feasible payoff vector for which 
there is some mixed strategy stage game NE which yields the payoff 
vector v' s.t. vi < vu; for every i © N then there is a SPNE in the 
repeated game which yields the payoff vector v. 


6. Application: Interethnic Cooperation 


Fearon and Laitin (1996) use infinitely repeated games to under- 
stand how inter-ethnic cooperation might be sustained. Consider two 
groups A and B both with n (even) members. In each period t, players 
are randomly matched to play the following Prisoner’s dilemma. 


Table 5: Inter-Ethic Cooperation Game 


where a > 1, 6 > 0, and an8 <1. Further suppose that each of the 
members has a common discount factor 6 € (0,1). In each period m 
members of each group are selected to be paired with members of the 
other group while the remaining n — m are matched with members of 
their own group. This random matching process suggests that each 
player will have ap = ™ probability of being matched in an “out-group” 
member. 

To capture the dynamics of intergroup and intragroup interaction, 
Laitin and Fearon assume that within groups the entire history of play 
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is observed by all members of the group. However, the history of play 
for members of the other group are not observed. Thus, in their model 
inter-ethnic cooperation is hard to sustain because those who defect in 
inter-group interactions cannot be individually singled out for punish- 
ment by members of the other group. Nevertheless, Fearon and Laitin 
argue that cooperation can be sustained even in the absence of these 
direct sanctions. They consider two such equilibria to this game. The 
first is what they call the spiral equilibrium. In this equilibrium, coop- 
eration is supported within groups by k’” period punishments against 
individual defectors. However, intergroup cooperation is sustained by 
the threat of group specific punishment phases of k®’ periods. During 
these punishment phases, all members of a given group are punished 
by the other group if any has defected in an inter-group interaction. 
The second equilibrium is the in group policing equilibrium in which 
there is no cross-group punishments but each group punishes its own 
for defections against the other group. Below we analyze the in-group 
policing equilibrium and refer the reader to the original article for the 
discussion of the spiral equilibrium. 


6.1. The In-group Policing Equilibrium. The strategy for the 
in-group policing equilibrium follows. 


Play C in all out-group pairings. For in-group pairings, 
always play C’ with any partner not in a punishment 
phase, and D in a punishment phase. A player enters 
or restarts a punishment phase for k%? periods by defect- 
ing against the out-group member or against an in-group 
member 


We will focus on group A as the proof extends identically to the 
strategies of group B. Let s; = (ki, ko, ...kn) be the state of the system 
where k; is the number of periods remaining in the punishment period 
for player 7 at the beginning of period t. If k; = 0, we say that player 
zi is a cooperator and that player 7 is a defector if k; > 0. For a given 
state s,; and any integer / > 0, let ni4; be the number of members of 
group A who will be cooperators in period t + 1, assuming that each 
plays the equilibrium strategy from period t to period t+/. Therefore, 


G+ = “+ is the probability of facing a cooperator in an in-group 
interaction. 


To demonstrate that these strategies constitute a subgame perfect 
Nash equilibrium, we need to verify the following conditions. 


(1) A cooperator 7 has no incentive to 
(a) to defect against any out-group player 
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(b) to defect against an in-group 

(c) to cooperate with any in-group defector 
2. a defector 7 has no incentive to 

(a) to defect against an out-group player 

(b) to defect against an in-group cooperator 

(c) to cooperate against an in-group defector 


Clearly, conditions 1(c) and 2(c) will be always satisfied since those 
deviations lower utility and the current period without affecting strate- 
gies of any other player (these deviations do not trigger punishments). 
Further note that 1(a) and 1(b) reflect the same trade-offs since both 
deviations generate a payoff of a in period t followed by k% periods 
of punishment. Thus, we need only establish that there will be no 
incentive to deviate in cases 1(a), 2(a), and 2(b). 

We can write the utility for cooperation against an in-group coop- 
erator or out-group member as 


1+ e 5 (p + (1 — p) (qr + (1 — ae4i)a) 
l=1 

while the utility from deviations 1(a) and 1(b) is 
(9.1) 

kar co 
at 37 Hp H(-p)(—aiB+—aed0+ S> S(p+(1—p)(der+(—geede) 

l=1 [=k9? +1 
The net utility of cooperating is therefore 


kgP 


(9.2) L—a+ 30 5'((L =p) (qr (1+ 8) + (1 gen) 


To show that deviations 1(a) and 1(b) will not occur, we need equation 
9.2 to be positive for all states and the resulting sequences of q4;. If 
a > 1+, equation 9.2 is minimized by q4); = 1 for / =1,k9?. This is 
the path following s; = (0,0,...,0). The net utility is positive following 
this state if and only if 


(9.3) Ge (1 ~8) (a= 1) 

op) (ae) 
Now consider the case where 1+ 6 >a. The the net utility would be 
minimized at q@4; = 0 for 1 = 1,k9?. However, given the definition of q¢ 
this is an infeasible sequence since all players are assumed to cooperate 
in their punishment phases and terminate their punishments after 9? 
periods. Thus, the minimizing sequence is one where all players defect 
in time t — 1 and return to cooperation status in period t + k% — 1. 
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The sequence of q is therefore q:1; = 0 for ] = 1,49? —1 and qizo = 1. 
Thus, we now require (after some algebra) that 
© = o a fe jhe ss a-—l 
d= 9~ {Ls B) Lp) eRe) 
Now we need to check to see whether a defector at time ¢ will wish to 


make deviation 2(a). Suppose that a defector with k; is paired against 
an out-group player. The utility of cooperating is 


(9.4) 


73 5! (p+(1—p) (—ar418-+(1—a141)0)+— 5'(p+(1—p) (Git —qi)a) 


while the utility of the deviation is given by equation 9.1. Thus, the 
defector will cooperate with an out-group member so long as 


kgP 


S- 5((1—p)(Gri(1 + 8) + (1 — gia) 2a-1 


The right side of this inequality is minimized when k; = k9? so that we 
require 
aac! — P)(Ge+na(1 +B) + (1 — Ge+nor4i)a) = a—1 
Since all players play according to the equilibrium strategy, q+40» = 1 
for all s,. Therefore, we require 
a-1l 
(9.5 ae: 
Gass) 
Finally, we need to check deviation 2(b). So assume a defector with 
k, = 1 is paired against as cooperator. The utility of cooperating is 


+5) 5 (p+(1—p) (—q18-+(1—4441)0)+ 9 © 6'(p+(1—p) (q+ (1—ae41)@) 


Again the utility for defecting is given by 

kgP 00 

Yo OHA —p)(—ae18-+(1—ay1)0)+ S26 +—p) (der (1-aey)@) 
[=1 l=k9oP+1 


kop 
so that the net utility of cooperating is S~ 6’((1—p)(q@4i(1+ 8) +(1—- 
(=k; 
d+1)a) — 8. Using the same argument as we did on 2(a), our SPNE 
requires 


K9P B 
ey Smee) 
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Now we have a full set of equilibrium conditions. First, consider 
the case of a > 1+ 6, we require 9.3,9.5,and 9.6. First, note that 
equation 9.6 holds whenever 9.5 does. Less obviously, we can show 
that equation 9.5 implies equation 9.3. We can re-write 9.3 as 


kop a-—l (a — 1) 
mo 5(6 a) a 


Note that if equation 9.5 holds both sides of equation 9.7 are positive 
and the right side must be larger since 1 > 6 > 6”. Thus, 9.5 
is necessary and sufficient for the in-group policing strategies to be a 
SPNE ifa>1+68. 

Now consider the case a < 3 +1 where we require equations 9.4, 
9.5,and 9.6 to hold, but now 9.6 implies 9.5. Also, if equation 9.6 
holds, note that 


kgP B = 1 a— 1 (6 a oe") 
on = > — 
Upp) Capp). (pba ep) be ap) 
Therefore, equation 9.6 implies equation 9.4 so that 9.6 is necessary 


and sufficient for the in-group punishments to constitute a SPNE. We 
have established the following proposition. 


PROPOSITION 9.4. The in-group punishment strategy with k9? pe- 
riod punishments is a SPNE if and only if 6"”” > min { 


a—1 B \ 
(1-p)(1+8)’? (—p)(1+8) 


Note some important features of the SPNE. First, if 6°” > min { ve 


1 B \ 
(1—p)(1+8)’? (1—p)(1+8) 


holds for k9? > 1, it must hold for k9”? = 1. Thus, no more than a 
single period of punishment is required to sustain the equilibrium. In 
fact, longer punishments are counterproductive since they lower the 
incentives of defectors to cooperate in order to end the punishments. 
A second important point about the SPNE is that they can only be 
sustained if p, the probability of out-group interactions is low enough. 


Since the SPNE requires min Tope) oj <1, it can never 


ree 1- oot}. When the probability of interaction 
with the out-group is large, the probability of punishment for any devia- 
tion is low since punishments are meted out only from in-group players. 
This has the somewhat counterintuitive implication that inter-ethnic 
cooperation is impeded by too much inter-ethnic interaction. Fearon 
and Laitin argue that this result provides an endogenous rationale for 


groups wanting to preserve ethnic boundaries. 


exist if p > min { 
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7. Application: Trade Wars 


Consider a generalization of the free trade game as presented in 
Table 9.6. 


Table 9.6: Generalized Free Trade Game 


We now interpret © as the value to each country of the other coun- 
tries open markets and p as each countries gain from protecting its own 
markets. From before, we know that Free Trade can be supported by 
the grim trigger strategies if an only if < ee oat Oe fe. or 


p 
> ae 
6 


Supporting this equilibrium depends crucially on each country per- 
fectly observing the policies of the other countries. This may not 
be a very realistic assumption since countries may use invisible trade 
barriers. Also since trade flows will vary with a number of market 
conditions unrelated to trade policy, each country will not know for 
certain whether the fall in trade is due to malfeasance by the other 
side. 

To model these issues, we assume that each country cannot directly 
observe the policies of the other country, but observes only the value 
of its trade © which is random variable. To keep things as simple as 
possible, we let 0; = 8 > 0 with probability 7 when country 7 engages 
in free trade and 0 with probability 1 — 7. When country 7 protects 
its markets, 0; = 0 with probability 1. Consequently, country 7 knows 
for sure if 7 chooses free trade if 0; = 0 but if uncertain of 7’s policies 
when 0; = 0. We will assume that 70 > p so that each county prefers 
the free trade outcome to mutual protectionism in expectation.? 

Clearly, just as before, protection is a SPNE to this game, but we 
would like to see if there are equilibria which will sustain some level of 
free trade. Obviously, such an equilibrium will require some form of 
punishment when ©; = 0 is observed even though it cannot be verified 
with certainty that country 7 actually defected. 

First, consider a grim trigger strategy in which country 2 protects 
forever whenever it observes 0; = 0. Thus, the payoffs to free trade 
in the first period are 70 + (1—7)0. Free trade continues to the next 


°This model is based loosely on Green and Porter’s (1984) model of imperfect 
collusion and price wars in economic cartels. 


214 9. REPEATED GAMES 


period so long as Q; = @2 = 6 which occurs with probability 77. Thus, 
country 7’s payoffs in the second period are 7?(7@ + (1 — 7)0) + (1 — 
m)p = 7°04 (1—77)p. Continuing the same logic to period 3, we get 
m@+(1—74)p. Thus, the infinite discounted sum of utilities from free 
trade are 

VET = 70 +.6 (090+ (1—17)p) +5? (7°04 (1—2*)p) +... 

VPP = 701+ 6r? 4+ 00 +...) +61 —r)p +01 - a )p+... 

VFP 70 — 6779 | Op 

1 — dn? 1-6 

The utility from defecting to protection is more straightforward. The 
one period payoff is 7# + p while the future payoff is cs so that V? = 
m0+-. Thus, country é will choose free trade if and only if V"7 > VP 
or 


p 
é> “30 

For comparison, note that if policies were observable, a SPNE in grim 
trigger strategies would exist so long as 6 > 4. Thus, the grim 
trigger strategy is significantly more difficult to sustain when policies 
are unobservable. In fact, if p > 736, grim trigger strategies would not 
constitute a SPNE for any value of 6. The grim trigger strategy is also 
very costly in the sense that infinite punishments can be generated by 
variation in ©, independent of policy. 

So now we will follow Green and Porter (1984) and consider finite 
trigger strategies. Now if either country observes O = 0, a trade war 
begins in which both countries protect their markets for k > 1 periods. 
We know check conditions under which free trade is the optimal policy 
if there is no trade war going on. ° Let V,* be the value of the payoffs 
for country 7 for a k period trade war. It is easy to see that 

ve (1—6")p 
‘ 1-6 
Let V*" be the payoff beginning a period in which the countries are in 
a free trade phase. Therefore, 


VET = (04 76V"") + (1—-W) 5 (Vi + 0V"") 
Thus, the equilibrium payoff to free trading is 
APRs m8 + (1-77) OVS 
1 — 126 — (1 — 1?)5*** 


Since mutual protection is a Nash equilibrium, we do not need to check the 
optimality of protecting during a trade war. 
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We can compute the value of a deviation as V? 
VP =704+p+6(Vi+5'V"") 
An equilibrium requires that V"? > V? or 
k+1 
The left side of this expression is increasing in k so let k™"(6) be the 


smaller integer such that the inequality holds for a given 6. Thus, we 
have established the following proposition. 


(9.8) 


PROPOSITION 9.5. If > -§ andk > k™"(6), the following strate- 
gies is a SPNE. 


(1) Begin the game, free trading. 

(2) Free trade until 0; = 0 for either country. 

(3) Following a period in which 0; = 0, protect for k periods. 
(4) After k periods, return to free trade. 


Note that a SPNE can be supported with trade wars of any length 
greater that k™"(5). However, if we assume that the countries can 
coordinate on the optimal duration of trade wars, the model provides 
a theory of their duration. Intuitively, since trade wars are costly, 
the countries should coordinate on the minimal length war sustaining 
cooperation, k™"(d). This intuition can be verify by checking that 
VT is strictly decreasing in k so long as 7 > p. Thus, we can derive 
empirical predictions by examining equation 9.8. Recall that the left 
side is increasing in k, this implies that k™"(6) is increasing in the value 
of protectionism and decreasing in the value of free trade. Clearly this 
makes sense. Trade wars should be longer when the incentive problems 
are more severe. We can also see that the duration of trade conflict is 
decreasing 7. This is a very counter-intuitive result. Suppose that we 
interpreted 1 —7 as the volatility of trade flows (the probability of low 
trade during a free trade regime). This interpretation suggests that 
trade volatility increases the duration of trade wars. This is necessary 
to keep countries from enacting barriers and blaming the results on 
natural volatility. However, in equilibrium, the countries never protect 
outside trade wars so that they know with certainty that 0; = 0 was 
caused by natural volatility. Yet they must engage in costly, length 
trade wars to ensure that barriers are not erected. 


8. Exercises 


EXERCISE 9.1. Assume that there are three groups with the follow- 
ing preferences over three policies. 
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We will be analyzing this as a repeated game where in each period 
groups may make a counter proposal to the status quo. For example, 
suppose that the status quo is x then both B and C’ wish to propose z 
which passes an becomes the new status quo. In the next period A and 
C’' wish to propose y which passes. Assume that each group discounts 
the future by 6. 

a. What is each group’s utility from the infinite cycle (starting 
at x) of policies that results? 

b. Now suppose that A and B decide to form a political party to 
implement policy x forever. Find a critical value 6” such that 
if 6 > 6° there is a subgame perfect Nash equilibrium where 
A and B vote for x in every period, C' proposes z, and if A or 
B ever defects a policy cycle starts. 


EXERCISE 9.2. Prove Proposition 1. 
EXERCISE 9.3. Prove Proposition 8. 


EXERCISE 9.4. Find conditions for the existence of the Spiral SPNE 
to the Fearon and Laitin’s model. This equilibrium is based on the 
following strategies: 


In in-group pairings, always play C’ with cooperator, and 
always play D against a defector, regardless of ones sta- 
tus. A player enters or restarts the in-group punishment 
phase for k’” periods by defecting against a cooperator. 
In out-group pairings, play C if neither group is in an out- 
group punishments phase. Otherwise play, D. A group 
enters the out-group punishment phase for k° periods if 
any member defects in a cross-group pairing when nei- 
ther group is in the out-group punishment phase. 


EXERCISE 9.5. Consider the model of trade wars. Construct the 
following “probabilistic grim trigger SPNE.” Instead of reverting to 
protectionism forever the first time 0; = 0 is observed, assume that 
country 1 plays a mixed strategy and protects forever with probability ju. 


CHAPTER 10 
Bargaining Theory 


If political science is the study of “who gets what, what, when and 
how” then bargaining theory lies at its foundation.’ Legislators and 
executives bargain over new legislation. States bargain to reach new 
international agreements and to settle crises. Political parties bargain 
over coalition governments. And so on. 

Not surprisingly given its importance, the application of game the- 
oretic models of bargaining to study political processes has been a very 
active area of research. These models have focused on two sets of 
issues. The first are the questions of distribution — “who wins” and 
“who loses.” Does the president get his preferred legislation? Which 
country gets to control the disputed region? Which parties received 
government portfolios? The second important question concerns the 
efficiency of political bargaining. Does the bargaining process itself 
consume resources or fail to reach outcomes that make everyone better 
off? Does legislative bargaining end in gridlock or a veto even though 
there are policy compromises that all prefer? Do international dis- 
putes end in costly militarized conflicts and wars? Why does it take 
so long to form new coalition governments? 

In this chapter, we review some of the most important bargaining 
models and their application to political science. 


1. The Nash Bargaining Solution 


One of the earliest attempts to model bargaining was the framework 
developed by John Nash. His approach was axiomatic in that he stipu- 
lated a number of features that should characterize the outcome of any 
bargaining situation. Before discussing his axiomatic requirements, 
we describe his “solution” to the bargaining problem Our discussion 
closely mirrors that of Muthoo (1999). 

Suppose that two players A and B are negotiating over the allo- 
cation of X units of some resource. We assume that X is infinitely 
divisible so that the feasible allocations are all x4 and xp such that 
ta+xp<X. Each player receives utility based on their allocations, 


‘See Lasswell (1936). 
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Ua(xa) and Up(xp). We assume that U; is strictly increasing and 
concave for both players i = A,B. In the event that no agreement 
is reached, each player receives a default utility, disagreement value or 
outside option of u; > u;(0). Finally to ensure that the bargaining 
problem is non-trivial, we assume that there exists at least one alloca- 
tion (#4,2g) such that U;(x;) > u; and x4 +a, < X. This ensures 
that there is feasible allocation that both players prefer to their dis- 
agreement values. 

In analyzing Nash’s solution to this problem, it is useful convert it 
into one of allocations of utilities (wa,ug) rather than one of alloca- 
tions of X. Therefore, we define the feasible utility allocations as the 
set Q = {(ua, ups): ua(ta) = Ua, UA(ZB) = Up, and 74+2B < X}. 
Given our assumptions about the utility functions, the boundary of 
this feasible set can be represented as a locus of points such as the 
one in Figure 10.1. We define this locus as the function g(u4) = 
Up(X — UZ'(ua)). Muthoo (1999) provides a proof that g is both 
decreasing and concave in u,. To simplify our exposition, we assume 
that it is twice-differentiable. 

Now we can state Nash’s solution to the bargaining problem. Based 
on the axioms we discuss below, his solution is the utility allocation 
(u4,Up) € Q that maximizes 


(wa — U4) (Up — Up) 


subject to wa > Uy and ug > Up 


The requirement that u4 > wu, and ug > Uz is illustrated by the dotted 
lines in Figure 10.1. Thus, the constraint set is g(u,4) on the range 
[us,g 1 (up)]. Since g is concave and decreasing, the feasible set is 
convex. It is easy to see that the Nash product (ua — u,) (up — Up) 
is quasi-concave in both wa and ug. Thus, we can represent its level 
curves in the region that the product in positive as the heavy dotted 
lines in Figure 10.1. 


Insert Figure 10.1 Here 


Thus, there is a unique Nash bargaining solution at the tangency 
of g and the iso-product curves. Mathematically, the solution to the 
constrained optimization problem is given by 


UB —U 
—9' (ua) = ~—# 
UA — WA 

UB = g(ua) 


Before moving to a general results about the Nash bargaining solution, 
it is useful to consider some special cases. First, assume that X = 1 
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and u;(z;) = 2;. The Nash Bargaining solution for this model is 


a ~Tia— Se and up = 2p = -— ist Se 

It is easy to see two important features of the solution. First, each 
player does better when disagreement provides it with a higher utility 
and worse when their opponent has a better outside option. Second, 
if each player has an equally valuable outside option, the resources are 
split evenly. Another way to interpret Nash’s solution is to note that 
the bargainers insist upon their disagreement values and equally split 
the surplus 1 — wu, — Uz which gives each a utility of u; + ae eee 

Now we turn to the general case. First, we can state the Nash 
bargaining solution in terms of shares. 


PROPOSITION 10.1. The Nash bargaining shares are given by the 
solution to 
Ua(ta) —Ua _ Up(X — £4) — Up 
Ui, (va) UR(X — a) 
PROOF. Direct application of previous result using the fact that 


_y - - Ui, (X-Uz*(u 
g'(ua) = —Ug" (X — Ug'(ua)) - U(X — Ug"(wa)) = — FPR 


and Uj*(wa) = 2a. el 


A direct implication of this result is that if the disagreement values 
and utility functions are the same for both players, the Nash bargaining 
shares are 24 = tp = 5X . Finally, we show that given our assumptions 
about g payoffs increase in one’s own disagreement value and decline 
in the opponent’s. 


PROPOSITION 10.2. Assume that g is twice-differentiable, then let 
Guu > 0 and 42 <0 fori Fé j. 


Si 


PROOF. Since g is twice differentiable, we can use implicit differen- 
tiation to the solution ee + g'(ua) = 0. Since the second order 


condition is satisfied ie, TW@s-un) amu) + 94) < 0, the result 


(ua—ua) 
follows from —— < 0, “4-48 5 0, and gq (ua) <0. O 
UA—UA (ua—wa) 


1.1. Application: Risk Aversion and the Nash Bargain- 
ing Solution. Intuitively, risk is an important component of bargain- 
ing. Bargainers always have to contend with the possibility that an 
agreement will not be reached and they will be left with their outside 
options. Also we should expect that if a player makes a more aggres- 
sive demand, she increases the probability that the negotiations will 
collapse. Consequently, it seems natural to think that bargainers who 
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are more willing to tolerate risk should do better because they are will- 
ing to make tougher demands and more aggressive reject offers. While 
the Nash bargaining model, “black boxes” the negotiation process, the 
solution is consistent with this intuition. 

To see this, assume that each player has a utility function given by 
U;(x;) = x where 0 < a; < 1, disagreement values u, = 0, and X = 1. 
The different values of a capture the players risk aversion, the lower a 
the greater the risk aversion.” It is easiest to compute the equilibrium 
shares using the formula from Proposition 10.1. The solution is 

QA 


£4 = ——— and rg = ———_ 
QAaa+ap Qa,4a+ap 


QB 


These results imply that each bargainer’s share is decreasing in their 
own risk aversion and increasing in the risk aversion of their opponent. 
This effect is consistent with our intuition that bargainers who are risk- 
acceptant enough to take tough positions (i.e. increase the likelihood 
of disagreement) should receive larger allocations. 


1.2. Nash’s Axioms. In this section we outline the axioms that 
underlie Nash’s bargaining solution. Informally, the axioms are in- 
tended to encapsulate the following principals. 


(1) The bargainers are expected utility maximizers. 

(2) Bargaining should be efficient. The players should fully al- 
locate all of the available resources and no player should do 
worse that their disagreement value. 

(3) The allocation should depend only on the player’s preferences 
and disagreement values. 

(4) The bargaining solution should no be effected by eliminating 
from consideration allocations other than the solution. 


To formalize this axioms, recall that 2 is the set of feasible util- 
ity applications (w4,ug) that can be reached through some alloca- 
tion of X. We now define the set of Pareto optimal allocations as 
Q° = {w EQ: ua > uy and g(ua) > up}. We can define a generic 
bargaining situation as a pair (Q,u) where wu is the vector of disagree- 
ment values. We denote the set of all bargaining games as © and the 
bargaining solution as a correspondence F' : © = R?. We let F; denote 
the utility allocated to agent 7. 

The following axioms form the basis of Nash’s solution. 


. . . uy “1 . 
2A standard measure of risk aversion is —4,. For these utility functions, 
u’ — — a(a—1)a*~?_ __ (1—a) 
Se ape pe 


u are x 
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AXIOM 10.1. Invariance to equivalent utility representations: Let 
Ui =a,U;4+ 6; and uj, = axu;+ 6; for a; > 0 and define 1 accordingly. 
Then F; (Q',u’) = a F; (Q,u) + 6; fori = A, B. 


Affine transformations of utility functions and disagreement utilities 
should not alter the bargaining outcomes. Since the utility allocations 
are adjusted by the same transformations as the utility functions, it 
is easy to show that the resource allocations 7; = F,'(Q,u) and x, = 
F1(0',u’) for i = A,B. As we know saw in chapter 3, this axiom 
implies that the players are expected utility maximizers. 


AXIOM 10.2. Pareto efficiency: If F (X%) = (ua,up), then there are 
no other allocations (u!,, ug) € Q such that uj, > uj for some i, ul, > uj 
for j Ai, and wu; > u; for alli. 


The Pareto axiom holds that the bargainers should not be able to 
improve upon the bargaining solution by choosing an allocation that 
makes one of the bargainers better off without reducing the utility of 
the other. 


AXIOM 10.3. Symmetry: Letu, = Up and assume that (uy, u2) € Q 
if and only if (u2,u1) € Q. Then Fa(Q,u) = Fp(Q, u). 


The basic idea of this axiom is that if neither player is advantaged 
by having a better disagreement outcome or a utility level unreach- 
able by her opponent, then the bargainers should get the same utility 
allocations. 


AXIOM 10.4. Independence of Irrelevant Alternatives. Consider 
two bargaining situations (Q,u) and (Q’,u) such that Q' C Q and 
F(O,0) CO, Then.F (Oyu) = F (OQ, a) 


The intuition behind the IIA axiom is that, holding the disagree- 
ment points constant, a smaller feasible set of allocations should only 
change the bargaining solution if it makes the original allocation infea- 
sible. 

From our analysis of the Nash bargaining solution in the previous 
section, it is clear that it satisfies all of these axioms. However, the 
next proposition establishes that it is the only solution which satisfies 
all four axioms. 


PROPOSITION 10.3. A bargaining solution F : % = R? satisfies 
axioms 1-4 if and only if it is the Nash bargaining solution. 


PROOF. see Muthoo (1999). O 
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2. Non-cooperative Bargaining 


While it does make a number of reasonable empirical predictions, 
the Nash bargaining solution is best interpreted as a normative argu- 
ment about what bargaining outcomes should look like rather than a 
positive theory about how actual bargaining will take place. In this 
section, we turn to non-cooperative game theoretic models which de- 
duce behavior of bargainers under different extensive forms. 

The starting point for the application of non-cooperative game the- 
ory to bargaining is the model of Rubinstein (1982). Suppose that 
two players are trying to decide how to divide $1. The players will take 
turns making offers so that player 1 proposes in periods 0, 2, 4, etc. and 
player 2 makes proposals in the other periods. The game continues 
(possibly infinitely) until a proposal is accepted by the other player. 

In each period that she is the proposer, player 1 can make an offer 
(x1,%2) where x, is player 1’s share and £2 is player 2’s share where 
%,+%_2 <1. If player 2 accepts, the game ends and the dollar is 
divided accordingly. If player 2 rejects, then she gets to make an offer 
(1,22) and the game continues if player 1 rejects. To simply matters, 
we assume that both players have linear utility functions u; = 7; and 
U2 = ®. Each player has a discount factor 6; so that players value a 
proposal of (x, 22) t periods in the future as (521, 6522). 

Just as in the bargaining game we encountered in chapter 7, there 
are lots of Nash equilibria to this game. For example, consider the 
strategies “Player 1 demands x, = 1 and refuses all other offers, while 
player 2 always offers x, = 1 and accepts any offer”. However, this 
equilibrium is not subgame perfect. If player 2 rejected player 1’s first 
offer, and offered 7; > 6, player 1 should accept because the best it 
can get is the whole dollar next period. So we will focus on subgame 
perfect Nash equilibrium. 


2.1. Subgame Perfect Equilibria. Rubinstein shows that there 
is a unique SPNE to this game based on playing the following strategies 
in every period: 


1-62 62(1—61) 
1—6 162’? 1—6169 


) and accept player 2’s offer if and 


61(1—d2) 
11-6142 * 


Player 1 proposes ( 


only if x, > 


6i1(1—d2) 1-6; 
1—6162 ? 1—6162 


) and accept player 1’s offer if and 
62(1—61) 
1—6162 * 

We begin by simply verifying that these strategies are in fact a 
SPNE. First we check whether player 1 has an incentive to defect in 
any subgame. Consider a subgame beginning with a proposal by player 


Player 2 proposes ( 
only if x2 > 
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1(i.e. an even period). In the equilibrium, player 1 proposes the split 
(7, cae Su)) which is accepted by player 2. Clearly, player 1 cannot 
gain by lowering x, as it will be accepted but she gets a lower share. 


If player 1 raises 7,, then she must lower x2 so that the proposal is 


feasible. However, any r2 < alto) will be rejected. Following such 
a rejection, player 2 proposes 7, = — which player 1 accepts. 


6? (1—62) 


which is less than 
1-309 


Thus, player 1’s utility of oe defection is 


her equilibrium utility of 7 Le “3p since 6, < l. 

Now consider lieben player 1 will defect when player 2 is the 
proposer (i.e. an odd period). Player 2 proposes (Go) a). 
Note that accepting and rejecting the offer lead to the same utility as 
the best that player one can do is have 7 = accepted one period 
later. 


Showing that player 2 will not defect is entirely similar. 


1—d2 
1-—6162 


2.2. Computing the Equilibrium. The problem with the pre- 
ceding proof is that it does not give much of a sense of how the result 
is derived. Now we consider a more constructive proof. Let v; and 
v2 be the utilities of player 1 and 2 for subgames in which they are the 
proposer. For example, if player makes a proposal x7, that is accepted 
Vy = x. If player 1’s proposal is rejected, v, is the discounted values 
of the maximum of what player 2 offers and what it gets by rejecting 
and proposing in his next turn. Given that the postulated strategies 
are the same in every period, these values are independent of t. We 
will call these continuation values since they also reflect the utility of 
rejecting a proposal and moving to the next subgame. Consider a 
subgame where player 1 is the proposer. She must offer player 2 at 
least dov2. Thus, 71 = 1 — dav. Since this offer is accepted v1 = 71 so 
that vy = 1—d2v2. Consider a subgame where player 2 is the proposer. 
She must offer at least 61v1 so that ve = 1 — 6,1}. 

Solving these two equations leads to 


<a 
oa Ree e 

= 1-6, 
ae ee a 


These continuation values are consistent with the strategies presented 
in the last section. 


2.3. Uniqueness. While we have shown that Rubinstein’s equi- 
librium is a subgame perfect Nash equilibrium, we have not ruled out 
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the possibility that there are others. We now show that this equilib- 
rium is the unique SPNE by proving that v; and v2 above are the only 
continuation values consistent with a SPNE. Suppose there are more 
than one SPNE. Let 0; and vu; be player z’s highest and lowest SPNE 
continuation values for any subgame where player 7 is the proposer. 
Let @; and w, be player i’s highest and lowest SPNE continuation 
values for any subgame where player 7 is not the proposer. 

When player 1 makes a proposal, she never need to offer more 
than 202 since player 2 cannot expect more than V2 by rejecting and 
making her own proposal in the next round. Thus, her lowest possible 
continuation value must satisfy v; > 1 — 422. By the symmetric 
argument, v. > 1 — 610;. Since we now know that the other player 
will never offer anything greater than 6,0; then we know that @; < 6;,3;. 

Now consider player 1’s strategy. When she proposes the best she 
can do is either to pay 6oV, or trigger a rejection to get 6,;™@,. Thus, we 
know that her continuation value satisfies 0; < max {1 — dgV5, 610 } < 
max {1 — 62Vo, 5701} = 1 -—d9v,. Similarly, 02 < 1—-—46,v,. Thus, we 
have the following four inequalities 


VU, = 1— be0q 
Ug > 1-610) 
Va < 1— 610, 
V1 <1— dau, 


Using the first and third inequalities, we see that v, > 1 — d2(1—4d1v,) 
which implies that v, > pre. Similarly, using the second and fourth, 


we get 0, < 1—69(1—640)) or T% < pk. This implies that 0; = v, = 
a . Similarly, we can derive that V2 = vy = — . Thus, there 
102 102 


is a single continuation value for each player. Thus, the postulated 
strategies are the only SPNE. 


2.4. Implications. The model suggests a very simple path of play. 
In period zero, player 1 proposes (oes, ace ), player 2 accepts, and 
the game ends. Since the whole dollar is allocated and there is no delay, 
the subgame perfect Nash equilibrium is efficient. It is easy to see that 


the SPNE has the following implications. 


(1) If both players have the same discount factor, there is a first 
: b5 20 O08) ie ; 

mover advantage since js > 4-57". Intuitively, since player 

2 discounts the future, player 1 only need offer her a fraction of 

what she would get for being the proposer next period. Since 

both players are identical, this means that player 2 is getting 


only a fraction of what player 1 gets. 
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(2) Both players shares are increasing in their discount factors and 
declining in their opponent’s. It pays to be patient. When 
player 2’s discount factor is high, player one has to offer her 
more to secure immediate agreement. Conversely, when player 
1’s discount factor is high, player 2 will have to offer him more 
to reach agreement in the event that player 2 gets to make an 
offer. Thus, rejecting player 1s offer is less valuable for player 
2 suggesting that player 1 gets to keep more in the first period. 

(3) Let 6; = 62 = 6, the both players shares converge to $ as 
6 converges to 1. As both players become perfectly patient, 
they are less willing to accept offers that are less than what 
they can get as the proposer next period. In the limit, they 
demand exactly what they expect to get next period which 
is satisfied by the proposal (5, s). One way to think about 
the discount rates converging to one is to consider a situation 
in which offers and counter- offers can be made very quickly 
so that rejecting an offer creates only infinitessimal delay. In 
such a case, the equilibrium is equal division and corresponds 
exactly to the Nash bargaining solution for this problem. 


2.5. Asymmetric Disagreement Values. In the canonical Ru- 
binstein game, the players get 0 in any period for which there is no 
agreement. We now modify the game in two ways. First, we assume 
that the players receive an allocation of (d;,d2) in each period prior 
to an agreement where dj +d, <1. After an agreement, (xj, x73) 
is reached, the bargainers receive this allocation in every period over 
an infinite horizon. This contrasts with the model of the last sec- 
tion where the allocation is “consumed immediately.”? To keep things 
simple, we assume that 6; = 62 = 6. Thus, the utilities of reaching 
(1-6'-1)di+état  (1-6'-1)do+6'as \ 4 


1-6 ’ 1-6 


agreement (xj, 74) in period t are 


Let v; be the continuation values for periods in which 7 proposes. 
If an agreement (x7, 73) is reached in such a period, v; = 5. Consider 


player 2’s decision to accept or reject an offer of x2. If she accepts, she 


3This modification rules out strategies where the bargainers delay infinitely in 
the hopes that the discounted sum of d; exceeds the one-period agreement. We can 
easily adjust the original model to correspond to the assumption that the agreement 
is over a flow of utilities rather than one-shot consumption. We would simply use 
the original model and assume that the players were allocating - 

4We assume that any agreement results in the same allocation in each period. 
However, since the players are risk neutral, there might be agreements to random 
allocations that generate the same payoffs. 
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gets a value of =; whereas if she rejects she gets dz in the current period 
and a continuation value v2 in the next. Thus, she will accept so long 
as 2 > (1 — d) (dz + dv2). Now consider player 1’s choice. If he makes 
the minimal acceptable offer 72 = (1 — 6) (dz + dve), his continuation 
value is v; = 1G) Ga-tova) = s — dy —dv,. Similarly, assuming that 
player 2 wished to secure an agreement with her proposals we require 
Vg = a —d,—dv,. The solution to these two equations is given by 


1—d,+dd, dy 1 —d, — dy 


ge ee age 
vp bedi t bd, tp 1-dh~ ay 
Bee aes OS eg (-& 


To show that these are in fact equilibrium continuation values, we must 
show that each player prefers to make their equilibrium proposal rather 
than defect and get their disagreement value for an additional period. 
Thus, we require v; > d;(1+6)+ 67x or 1 > 4, which is easily 
verified. Similarly, our equilibrium requires that v2 > & which is 
also satisfied. The techniques of Section 2.3 can easily be generalized 
to show that this is the unique SPNE. 

This equilibrium has a number of qualitative similarities to the 
Nash bargaining solution. Note that each player’s continuation value 
increases in her disagreement value and decreases in their opponent’s. 
This equilibrium also has a surplus-splitting interpretation. Note va 
each player’s continuation value has two components. a 7 
which is the utility each player can guarantee herself in the absence 
of any agreement. The second component <4—* corresponds to the 
equilibrium continuation value in a game to split 1 — d; — dz when the 
players have outside option values of 0. Thus, a useful interpretation 
of this equilibrium is that both players take what they are entitled and 
bargain over the rest. 


3. Majority Rule Bargaining Under Closed Rule 


A key feature of the Rubinstein model is that unanimous consent 
is required to reach an agreement on the allocation. This rules out a 
number of important political settings where only a simple or super- 
majority is required for agreement. Baron and Ferejohn (1989) have 
extended Rubinstein’s model to simple majority rule. 

Suppose that there are N (odd) players bargaining and any proposal 
requires n = (N + 1)/2 votes. Instead of assuming alternating offers, 
Baron and Ferejohn consider a bargaining protocol with a Random 
Recognition Rule. According to this protocol, in each period, every 
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player has an equal probability (1/N) of being chosen to be the pro- 
poser. In this section, we focus on bargaining under closed rule where 
the proposer makes a take-it-leave-it offer for the current legislative 
session. The proposer in each period makes an offer (21, 22,..., UN) 
such that 2; is the share for player i and we require >a; < 1. If 
this proposal is rejected, the session ends, discounting occurs, and a 
new proposer is chosen at the beginning of the next session. Ina 
later section, we consider open rule bargaining where proposals can be 
amended within the current session. To keep things simple, we assume 
that each player has the same discount factor 6. 

This game has lots of subgame perfect equilibria. In fact for large N 
and 6, there isa SPNE that can support any division of the dollar. This 
is due to the fact that if the players are patient enough, they can design 
punishment strategies to guarantee $0 to any defector. However, these 
strategies require that each player know the whole history (possibly 
infinite) of the game so as to know which actions are consistent with 
the prescribed punishment. Thus, following Baron and Ferejohn, we 
will analyze only stationary equilibria. A stationary equilibrium to 
this game is one in which: 


(1) A proposer proposes the same division every time she is recog- 
nized regardless of the history of the game. 

(2) Voters vote only on the basis of the current proposal and ex- 
pectations about future proposals. Because of assumption 1, 
future proposals will have the same distribution of outcomes 
in each period. 


These two assumptions imply that the game essentially starts over 
in every period. Therefore, the continuation value of each player is 
exactly the expected utility of the game. Let vu; be the continuation 
value for player 7. We will focus on symmetric equilibria so that v; = v 
for alli. Finally, we will focus only on equilibria in which voters do not 
choose weakly dominated strategies in the voting stage. Therefore, a 
voter will accept any proposal that provides at least as much as the 
discounted continuation value. Therefore, any voter who gets x7; > dv 
will vote in favor of the proposal while any voter who receives less than 
dv will vote against. 

Give these voting strategies, a proposer knows that she must pro- 
pose dv to n — 1 other players and 0 to the rest. Let z be the amount 
that the proposer keeps so that 


z=1-(n—-1)du 
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We will assume (although we can show that it must be true), that 
the proposer chooses her coalition partners randomly. Now we can 
compute v. Since the continuation value is just the expected value of 
the game starting next period, it is simply z times the probability of 
being chosen as proposer x dv times the probability of being included 


in the winning coalition not, and 0 times the remaining probability. 
Thus, 
2 eed 
= — + —— dv. 
2) NV + N Vv 
Substituting for z we obtain 
1 
v=—. 
N 


Thus, the continuation value is just a proportional share of the dollar. 
Since v is also the expected utility of the game, this result implies that 
bargaining is efficient in the sense that the sum of player utilities is 
maximized. As the reader will discover in the exercises, this efficiency 
result may not hold if voters are risk-averse. 

Finally, given our solution for v, we can compute the proposer’s 
share: 

n-1 sii 5 — a 
2N 
To ensure that the proposer will prefer to make an acceptable proposal, 
we must check that z > dv, otherwise a proposer would prefer punt and 
wait for the next period. This condition is easily verified. 

Among the important implications of the model is its predictions 
about proposal power. First, note that proposal power is increasing 
in N. When N increases, the proposer has more potential coalition 
partners to play off of one another. This increases the competition for 
inclusion in the winning coalition and drives down what the proposer 
must pay. Secondly, proposal power is decreasing in 6. When 6 is 
higher, the voters are more willing to vote down proposals and wait for 
a chance to propose themselves. Thus, the proposer must be relatively 
more generous to secure agreement. 


z=1-6 


3.1. Supermajority Rule. We can also easily extend the model 
to capture situations where more that a simple majority is required for 
passage of the bill. Now assume that k > n votes are required. If is 
easy to see that the proposer’s share is now 


z=1-(k—-1)dv 
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and the continuation values are now given by 
Zz of k-1 
v=— 
N N 
Simple algebra reveals that once again v = x This is not terribly 
surprising given that the supermajority rule preserves the same sym- 
metry we found in the majority rule game. However, the proposers 
equilibrium share is now lowered to z = 1 — ot. Thus, the pri- 
mary consequence of supermajority rules is to mitigate the proposer’s 
advantage. 


a) 


3.2. Asymmetric Proposal Power. A limitation of the preced- 
ing model is that it assumes that all legislators have the same prob- 
ability of being recognized to make the proposal. This assumption 
would ignore real world legislative institutions such as committees and 
parties which may affect the probability that an individual legislator 
gets to make a proposal. 

To show how the model generalizes, suppose that the members are 
divided into two parties A and B. Party A has N —m > n members 
so that it is the majority. Each member of A has a proposal power 
p> 1/N. Alternatively, there are m members of B who have proposal 
power gq <1/N. For consistency, we require that (N —m)p+mq = 1. 

Again we will assume symmetry so that every legislator with the 
same recognition probability has plays the same strategies and therefore 
has the same continuation value. The members of the two parties have 
continuation values v4 and vz, respectively. We conjecture for now 
(and prove later) that v4 > vg. Given these continuation values, 
a member of party A will vote for any proposal that provides her at 
least 6v4 and a member of party B will vote for a proposal giving at 
least dug. Given these strategies and the assumption that v4 > vg, a 
proposer from party A will give dvg to the m members of party B and 
dua ton —m—1 members of party A. Thus, 


za =1-—(n—m-—1)dv4 — mévg 


A member of B will give positive allocations to m — 1 members of B 
and n — m members of A so that 


Zp =1-—(n—m)dv,4 —(m— 1)duz 
Note that z4 > zg. We can now compute v4 and vg 


Va = p24 + p(n—m — 1)dv, + qm(n — m)du4/(N — m) 


vp =qzp+(1—q)dup 
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Thus, we have 4 equations with 4 unknowns. It is straightforward 
to solve, but its messy. So we will consider a simple example. Let 
N =3,m=1. Note that ¢g = 1—2p < 1/3. Therefore, the equilibrium 
conditions are the following: 


za =1—dvug 
Zp—=1—dv, 

va = pzat+ qova/2 

Up = qzpt+(1—q)dug. 


After some tedious algebra, we find that 


Goa) 
A 2+ q6 — 26 
vp = 22-9 

ee ee os 


We still need to check our assumption that v4 > vg. This occurs when 


_ gl 


< 
P1325. 8 


Since this upper bound is always less than 1/3 when 6 > 0, the asym- 
metry in proposal power must be substantial to give an advantage to 
party A. The reason is that its greater proposal power makes members 
of A unattractive coalition partners. Thus, the likelihood of being the 
proposer must be large enough to offset this effect. However, it is easy 
to show that that v4 is decreasing and vz is increasing in q. 

To complete our analysis, we need to consider what happens when 
+ Se) z We can rule out vg > v4 as this would imply that the 
member of B is never in a coalition with the proposer. Thus, 


UB = qzp = G(1 — bug) 
va = pza + (1—p)dva = p(1 — bua) + (1 — p)dva. 


(l-q) 


This leads to v4 = s>— and vp = 


Bia) g2-8-84) Note that if vg > v4 


2—3q) * 
zine Ps 5 which violates our original assumption about q. 
Thus, the only possible outcome for + mG ; is v4 = vg. To 
support this equilibrium, proposers from A must choose a mixed strat- 
egy that randomizes between formed a coalition with the remaining 
member of A and the member of B. We leave computation of the 


equilibrium mixed strategy to an exercise. 


only if g > 
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3.3. Asymmetric Veto Powers. Another institutional variation 
in legislative institutions is that certain players are privileged with the 
ability to block legislation such as the president, an upper chamber, 
or a court. In this section, we provide a simple example of how to 
incorporate vetoes into the Baron-Ferejohn model.® Now suppose that 
one member of our three person legislature has an absolute veto power 
in that she must approve every proposal. Let party B have the veto 
player. To keep things simple, we return to the case of equal proposal 
powers. 

Since B has an absolute veto, any proposer must include B in her 
coalition so that 


za =1—dup 
Zp—1— dv, 


Computing the continuation values: 


1 1 
VA = 3°A +r OgUA 

1 2 
UB = 3°B or O3uB 


_ _3(1-6) _ 3-26 
Thus, we can solve for v4 = rpEsTTam and vp = FE sp5cas Note that 


V4 < Up so long as 6 > 0. 


3.4. The Baron-Ferejohn Model under Open Rule. In the 
preceding sections, we have focused exclusively on models where pro- 
posals cannot be amended within the current legislative session. We 
now show how the model can be extended to capture the possibility 
that proposals can be amended before a final passage vote. We now 
assume that following each proposal a member is selected at random 
from the remaining N — 1 legislators. The selected legislator has two 
choices. First, she may call the question and bring about a final pas- 
sage vote on the previous proposal. Alternatively, she may make a 
new offer or amendment. The amendment is paired against the cur- 
rent offer. The winner of this vote is the proposal on the floor at the 
beginning of the next session. In the next session, a new legislator is 
chosen to either amend or call the question. 

Now a legislative proposer has two considerations. First, just as 
before, a simple majority must receive their discounted continuation 
values in order to support the proposal on final passage. Secondly, the 
proposer must craft a proposal for which deters others from amending. 


°This variation of the Baron-Ferejohn game was developed in McCarty 
(2000a,2000b). 
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This can be accomplished by allocating sufficient resources such that 
the next proposer prefers to move the initial proposal rather than have 
her own proposal on the floor at the beginning of the next session. 

To keep things simple, we will focus again on N = 3. First, we 
consider a scenario where the proposer keeps z, provides a to both 
other legislators, and each legislator moves the question. To solve for 
the optimal z, define v?(z) as the continuation value of beginning a 
session with a proposal giving z to player 7 and oa to the other two 
legislators. Since we are focusing on symmetric equilibria, we can 
suppress the subscript i. Thus, v?(z) is expected utility of this strategy 
of the first proposer and that of any proposer who successfully amends 
a proposal. 

Given this definition, a proposer must give each legislator at least 
dv*(z) to induce them to call the question Otherwise, she would 
defect to a proposal giving herself z for the next period. Therefore, 
the equilibrium requires that ++ > dv?(z). So long as this condition 
holds, the proposer gets z with probability 1 so that v?(z) = z. Thus, 
the proposer will choose to maximize z such that +5# > dv?(z). This 
leads to a solution of 


1 
ye, 
a ee eT, 


While the proposer can secure z = aa with certainty, it may prefer 
to secure the support of only one legislator and risk the defeat of their 
proposal if the other is selected to make an amendment. So now assume 
that the proposer keeps z, gives 1 — z to some other legislator, and 0 
to the third legislator. The legislator who receives 1 — z moves the 
question if selected. The legislator who receives 0 offers an amendment 
giving z to herself, 0 to the original proposer, and 1 — z to the other 
legislator. Such an amendment carries with the votes of the legislators 
receiving positive allocation in the amended proposal. 

To compute the optimal z, we need to consider two values. Let 
v;(z) be the value to legislator 7 of beginning the period with a proposal 
giving z toi and 1— z and 0 to the others. Similarly, let v}(0) be the 
value to 2 of the game starting from a proposal that gives her 0 and 
z and 1 — z to the others. Again due to symmetry we can drop the 
subscripts. 

First, we compute v(z). With probability 5; the proposal is moved 
and approved giving the proposer z. However, with probability 5, 
the proposal is amended so that the original proposer gets 0 in the 
proposal in play at the beginning of the next session. Therefore,v'(z) = 
5z + 36v'(0). Now consider the value of starting the period with 0. 
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With probability 5, the proposal is moved and passed leading to a 
payoff of 0. However with probability 5, the member is selected and 
can amend the proposal so that she gets z in the standing proposal at 
the beginning of the next session. Therefore, v'(0) = $6v'(z). Putting 
these two values together, we get v!(z) = 42 + 46°u!(z) or 


ez 

AR € 

Finally, we have to insure that the legislator receiving 1 — z prefers to 
move the question rather than amend. This requires that 1—z > dv!(z) 
or z <1—dv'(z). Therefore, the proposer will choose z to maximize 


v'(z) 


u'(z) subject to this constraint. The solution is z = which 
leads to a continuation value of 
2, 
1 
v (z) = ————__5 
(2) A+ 26 — 6? 


To determine which strategy the proposer will choose, we simply need 
to compare v'(z) and v?(z). Straightforward algebra shows that 
v'(z) > v?(z) when 6 > 6* = V3—1. Intuitively, when players 
are patient and value the future, it is very expensive to inhibit amend- 
ments from both legislators. Therefore, the proposer prefers to buy 
off only one member and take its chances with an amendment. 

There are two interesting features of the open rule model. First, 
it is possible that coalition are greater than minimum winning. This 
occurs when 6 < 6* so that the proposer spreads resources sufficiently 
to deter all amendments. Secondly, there can be equilibrium delay 
in agreement. This occurs when 6 > 6* and the proposer gives 0 to 
one member. If that member is then selected, they make a successful 
amendment which precludes agreement in the first session. 

It is useful to compare the equilibrium allocations from the open 
rule with those for the closed rule. The literature has paid particular 
attention to the proposer’s share.® Recall that for N = 3, the pro- 
poser keeps ae This share is always greater than v?(z) and greater 
than v'(z) when 6 > 6*. Thus, the open rule lowers the proposer’s 
advantage. However, proposal power can also be mitigated by the use 
of supermajority rules. Consider the case where k = N = 3. The 
proposer’s share is 3-28 which is always lower than v'(z). Thus, when 
6 > 6", the unanimity rule lowers proposal power below that of the 
open majority rule without incurring costly delay. 


6This is not only an important equity consideration. In models of allocating 
the benefits of costly projects, institutions which limit the proposer’s share reduce 
the incentive to pass inefficient projects (Baron 1992, McCarty 2000b, Primo 2004). 
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4. Bargaining with Incomplete Information 


In all of the bargaining models that we have seen so far all of the 
agents know the disagreement or continuation values of their oppo- 
nents. Consequently, they know with certainty which offers will be 
accepted and rejected.’ This assumption is obviously problematic 
in many political contexts. A legislature doesn’t know whether an 
executive will sign a particular bill, states do not know whether the 
peace terms will be accepted or if the opponent will prefer to continue 
fighting, and so on. In this section, we provide a bare bones model 
of bargaining with incomplete information. We then elaborate the 
model with examples from executive-legislative bargaining and crisis 
bargaining in international relations. 


4.1. A Basic Model. Consider the setup we used for the Nash 
bargaining problem where two players are negotiating over the division 
of X. However, now there is uncertainty about player A’s_ disagree- 
ment value. With probability 7, u, = 0 and with probability 1 — 7 
u,=d>0. We will refer to u, = 0 as the “weak” type and u, = d 
as the “strong” type. To keep things as simple as possible, we assume 
that player B makes a take-it-or- leave it offer to A of (ua,up) € Q. If 
A accepts, the offer is implemented. However, if A rejects, the payoffs 
are (U4, Up). 

Clearly, A will only accept an offer that gives her ug > u,. How- 
ever, since B does not know the value of wu, she does not know how 
much utility to transfer to A to secure her agreement. If she offers 
less than A’s disagreement value, A will reject leading to (wy, up). Let 
P(ua|u,4) be the probability that A will accept u4 when her disagree- 
ment value is u,. Therefore, we can write B’s expected utility function 
as a function of her offer 


EUzp(ua) = [tP(ua|0) + (1 — 77) P(uald)] g(wa)+[1 — tP(ua|0) — 1 - 


Since P(ual|u,) = 1 if ua > uy and 0 otherwise, we can re-write B’s 
utility as 


tg(ua) +(1—a)up ifd>u,s > 0 


EUp(ua) —s { g(ua) if uaad 


Given that g(u,) is decreasing in u,, the only possible solutions are 
ua =doru,=0. We will call wu, = 0 the aggressive offer and u, = d 


"In the open rule Baron-Ferejohn game, the uncertainty is about which player 
will be selected to offer an amednment, but not whether a particular player prefers 
an amendment to the proposal. 


™) P(wald)] we 
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the accommodating offer. B will choose the aggressive offer if and 
only if 7g(0) + (1 —7)upz > g(d) 

g(d) — Up 

g(0) — Up 

While very simple, the model makes a number of sensible predictions. 
First, B is more likely to make the “aggressive offer” of u4 = 0 when 


e the probability that A is the weak type is high. 

e her disagreement value up is good 

e the utility difference between the aggressive and accommodat- 
ing offers, g(0) — g(d), is large. 


T > 


Given the assumptions of one-sided incomplete information and a 
single take-it-or-leave-it offer, this model is very limited. However, 
rather than generalize this abstract model, we will review a number of 
political science applications which relax these assumptions. 


5. Application: Veto Bargaining 


In Chapter 7, we studied the application of the Romer-Rosenthal 
agenda, setting model to the presidential veto. While this complete 
information model of the presidential veto provides an excellent tool 
for studying veto power, it cannot provide a basis for studying vetoes, 
for the obvious reason that it predicts vetoes will not occur. We now 
turn to a simple model for studying vetoes, rather than veto power. In 
this model, vetoes do occur. This simple incomplete information model 
in turn provides the foundation for building more complex models of 
veto bargaining that incorporate reputation, learning, and dynamics.* 

If one wants to explain the fact that vetoes occur, one must dis- 
pense with at least one of the assumptions underlying the basic model. 
While the model presented in Chapter 7 has a number of very re- 
strictive assumptions, few of them are actually consequential in the 
prediction of no vetoes. One important exception is the assumption 
that C’ has complete information about the preferences of P and O. 
When there is such uncertainty, vetoes may occur because the legisla- 
ture overestimates its ability to extract concessions from the president 
or the override pivot. 

Relaxing the assumption of complete information has been the 
starting point for most of the recent work on veto bargaining (Matthews 
1989, McCarty 1997, and Cameron 2000). To present the basic flavor 
of these models, we consider a model without an override possibility so 


This section draws heavily on Cameron and McCarty (2004). 
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that q remains the policy in the event of a veto. To capture the un- 
certainty that the proposer C' faces about the receiver P’s preferences, 
we assume she believes P is one of two preference “types,” a moderate 
with ideal point m or an extremist with ideal point e. We assume all 
agents have linear preferences given by —|x — | for policy x € R and 
ideal point i € {c,e,m} where e <m<c. Let 7 be the probability 
that P is the extreme type. 

The main implication of the uncertainty about preferences is that C 
no longer knows for sure which bills the president will accept and which 
he will veto. To see this, consider Figure 10.2 where we assume that 
q<e. Here the set of bills the extremist type of receiver is willing to 
accept over the status quo is only a subset of those the moderate type 
is willing to accept. Thus, C can force a more attractive bill (from her 
perspective) on the moderate receiver than she can on the extremist 
one. C’s dilemma is whether to propose a bill she finds relatively less 
attractive but that both types will accept — a bill like 6. — or be more 
aggressive and propose a bill — like b,,, — she finds more attractive but 
only the moderate receiver will accept. Clearly, the attractiveness of 
the gamble depends on C’s beliefs about P’s type. If a is high (so 
C believes P is probably an extremist), C' will likely be deterred from 
making the aggressive proposal. On the other hand, if 7 is low (so 
C believes P is probably a moderate), C’ may well find an attractive 
gamble. If she offers it, on occasion it will prove a poor choice: the 
receiver will turn out to be the extreme type and will veto it. 


Insert Figure 10.2 Here 


Now we will compute the necessary conditions for an equilibrium 
veto to occur. First, assume the preference configuration of Figure 
10.2 holds,i.e. g<e<m<_c. Let B;,(q) be the sets of bills that each 
type t € {e, m} is willing to accept over the status quo. Similar to our 
analysis of the complete information version of the model, these sets 
are [q, 2t — q] if t > q and [2¢ — q,q] otherwise. Notice that for any gq, 
president m is willing to accept a higher bill that is e. Since e > q, so 
that b.(q) = [¢, 2e—q] C bm(q) = [¢, 2m — q|- any bill that e accepts m 
will accept, but the converse is not true. Therefore, C’ faces a tradeoff. 
It can propose 2e — g which both types accept, or can propose 2m — q 
which e will veto. Given C’s beliefs the latter strategy results in a 
veto with probability 7. 

Case 1: c > 2m—q. Given C’s linear preferences, her utility from 
b = 2e — q is 2e — q —c while her expected utility from b = 2m — q is 
mq +(1—72)(2m-—q)—-—c. Thus, if 7 < mag? She prefers b = 2m — q 
and a veto occurs with probability 7. 
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Case 2: 2e—q<c< 2m-—gq. C’s payoff from b = 2e — q remains 
2e —q—c, but now m will accept b = c. Thus, proposing her ideal 
point leads to an expected utility of m(q—c). Thus, C' will propose 
b=cifr#< cry 2 Note that the critical value of 7 is lower than in 
case 1 making a veto less likely for this preference configuration with c 
closer to m. 

Case 3: c < 2e—q. Now both types will accept b= c. So C’s 
proposes its ideal point for all values of 7 and no vetoes occur. 

The punchline of this simple model is that vetoes are much less 
likely to occur when C’s preferences are closer tom and e. Empirically, 
Cameron (1999) has found that vetoes are less likely to occur during 
periods of unified party control of Congress and presidency which he 
interprets as evidence for this prediction. 


5.1. Models with Reputation, Learning, and Dynamics. 
An interesting feature of the incomplete information model is that a 
moderate receiver P does better if the proposer C' believes P is the 
extreme type. This raises the possibility that P might attempt to ma- 
nipulate C’s beliefs about his type, his reputation. In this section, we 
examine three models in which the actors try to manipulate P’s rep- 
utation. All are signaling models, because an informed player takes 
an action that conveys information about P’s type. In the first two 
models, the veto threat and sequential veto bargaining (SVB) models, 
the informed player is the receiver P himself. In the third model, 
the blame game veto model, both C’ and P take actions to convey 
information to uninformed voters. 

5.1.1. Veto Threats. Ranging from the dramatic “read my lips” va- 
riety to the much more mundane “statements of administration policy” 
routinely produced by the Office of Management and Budget, the veto 
threat is an important feature of legislative politics in the U.S. How- 
ever, none of the models reviewed thus far provide any leverage on 
understanding this phenomena. Matthews (1986) however provides 
an influential model of veto threats where the president may use a 
costless signal or “cheap talk” to reveal information about preferences 
and veto intentions. 

To illustrate this model, it is helpful to increase the number of 
presidential types from two to four. Therefore, in addition to m and 
e, we add the two following types: r the “recalcitrant” type and a 
the “accommodating” type. We assume that each type and C’ have 
linear preferences and that r <q <e<m<c<_aas in Figure 10.3. 
President r is called recalcitrant because he will veto any bill that C 
prefers to the status quo while a is accommodating because he prefers 
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c to the status quo. We will also assume that the probability of these 
types are 7;,7e,7m, and 7. In this game, the president first makes 
a “speech” which is simply a costless signal to the legislature. Each 
of these messages has no literal meaning, just a contextual one derived 
from the equilibrium that is being played. Following the speech, C 
updates her beliefs about the president’s preferences and then makes a 
proposal which the president can either accept or reject. 


Insert Figure10.3 


As a baseline, first consider an equilibrium where the president’s 
speeches contain no information because each type makes the same 
speech. In this babbling equilibrium, C' will simply choose the bill from 
b, = q, be = 2e — Gg, bm = 2m —c, or bg = c to maximize her utility. 
For any such choice, those with lower types will veto. For example, if 
C' chooses b,,, types e and r will veto so that the veto probability will 
be 7. +7,. Rather than present the formulae for the conditions for 
each proposal, they are illustrated graphically in Figure 10.4. The first 
figure shows which proposal will be made in the babbling equilibrium 
for different values of 7,, and 7, for given values of 7, and 7. Note 
that the proposal b, = qg is never made since C’ does at least as well with 
a vetoed proposal. Note that this equilibrium is somewhat bad from 
the president’s perspective. If the president is type a, there is a utility 
loss associated with the fact that C’ may propose the less desirable 
policies b,, and b,.. For president m, there are losses associated with 
the fact that C’ might propose c (which he then vetoes) rather than 
his preferred 6... Since r and e, only get their status quo utility from 
all proposals, they are not affected. C is also affected by the lack 
of information as it may force her either to accommodate more than 
necessary or to risk a veto. 


Insert Figure 10.4 Here 


So given the bad outcomes from the babbling equilibrium, it is 
reasonable to ask whether there are other equilibria where more infor- 
mation is transmitted. Matthews shows that some information can be 
revealed in presidential speeches, but not all of it. First, consider why 
a separating equilibrium where every presidential type gives a distinct 
speech cannot be an equilibrium. If C’ could learn the president’s 
type from the speech, she would optimally propose 6,. to r, b. to e, etc. 
However, since m prefers b. to bm, m would prefer to defect and give e’s 
speech. Thus, a separating equilibrium cannot exist. Matthew’s shows 
that the most informative equilibrium is one where type a reveals his 
type with an “accommodating” speech and the other types all make 
the same “threatening” speech. Following an accommodating speech, 
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C’ correctly infers that the president will accept her ideal point and 
thus proposes c. Type a is willing to make the accommodating speech 
since she clearly prefers c to b,, or b-..Following the threatening speech, 
C learns that the president is not a and updates her beliefs accordingly. 
Given these beliefs, C' chooses between b,, and b,. The second panel of 
Figure 10.4 illustrates the optimal proposal as a function of 7, and 7, 
for given values of 7, and 7,. There are two important things to note. 
First, it is more likely that C’ proposes b. because the knowledge that 
the president is not type a makes the probability that b,,, will be vetoed 
much higher. This leads to the prediction that C’ makes a larger con- 
cession to the president’s preferences after a threatening speech than 
after an accommodating speech. 

It is important to note that an informative equilibrium is not guar- 
anteed to exist. Suppose type a preferred b,, to c to b., an informative 
equilibrium would exists only if C’s best response to the threatening 
message was b,. Otherwise, a would defect to the threatening speech. 
Similarly, if a prefers b. to c, no informative equilibrium can exist. 

It is possible for some configurations of preferences that the veto 
threat is simply a bluff. Consider what would happen if m were moved 
in Figure 10.3 sufficiently to the right that he preferred c to q (thus be- 
came an accommodator) but still preferred b. to c. In the informative 
equilibrium, m would still give the threatening speech, but it is a bluff 
in the sense that he would have signed C’s ideal point. 

The informative equilibrium makes C’ better off (if it didn’t she 
could just turn off the TV and ignore the speech). However, it is pos- 
sible that some presidential types will be worse off. | Suppose that 
a were repositioned so that his preference ordering were such that he 
preferred b,, to c to be. Further, suppose that the babbling equilib- 
rium produced 6,,, while a threat in the more informative equilibrium 
produced 6... Then a would clearly prefer the outcome of the babbling 
equilibrium to the c she gets from making her accommodating speech 
in the informative equilibrium. 

5.1.2. Sequential Veto Bargaining with Incomplete Information. Of- 
ten, the proposer can make multiple offers, learning about the receiver 
as she does so. For example, if the receiver rejects a tough offer early, 
the proposer may believe the receiver is genuinely tough. If so, the 
proposer’s next offer is apt to be more accommodating. This “hag- 
gling” dynamic is very common in many types of bargaining, and one 
might well expect to see it in veto bargaining as well. But a compli- 
cating factor is misdirection: the proposer will often have an incentive 
to reject early offers in order to build a reputation that leads to better 
later offers. But knowing this, why should the proposer actually make 
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the compromises? The sequential veto bargaining model (SVB) model 
explores these questions about learning and credibility. 

A simple example conveys many of the basic ideas. First, consider 
a situation in which q = 0, e = ;, m= .6,andc=1. By now it 
should be clear that in a one-shot game (without a veto threat) b. = .5 
and b,, =c. Using the results of the one-shot incomplete information 
model, it is easy to see that C' will offer b,, = c if 7 < 5 and 6b. = .5 
otherwise. But suppose this is not a one-shot game, so that C’ may 
make a second offer if the first is rejected. More specifically, suppose 
bargaining breaks down with probability p, but otherwise a second offer 
can be made (The probability of a bargaining breakdown reflects the 
inherent uncertainty of the legislative and other political processes.) Is 
a haggling equilibrium possible, that is, one in which C’ first makes a 
tough offer then, following a veto and no bargaining break down, makes 
a more accommodating offer? 

In such a haggling equilibrium, the moderate president must accept 
the tough offer in the first round (if both types rejected the tough 
offer, then C' should make the accommodating offer lest a break down 
saddle her with the unappealing status quo). Therefore, the following 
“incentive compatibility constraint” must hold: 


(m—c) = (1—p)(m—2e +4) + p(q—m) 


or 
c—q—2(m-—e) 

2(e—4) 
The incentive compatibility constraint indicates that accepting the 
tough offer in the first round is better for the moderate type than reject- 
ing the offer and holding out for the more proximate accommodating 
offer, taking into account the probability of a bargaining breakdown. 
In the example, the critical value for the break down probability is .6. 
Let pu(e) be C’s belief that the president is the extreme type, following 
a veto. Note the following: in a haggling equilibrium, it must be case 
that p(e) > 5, otherwise, following a veto, C’ will make the tough offer 
again in the final period (this was proven above). But if the probabil- 
ity of a breakdown is greater than .6, then the moderate type accepts 
the initial offer, so that by Bayes’ Rule j(e) = 1, following a veto, and 
C will indeed make the accommodating offer in the second round. 

There remains an additional incentive compatibility constraint to 
examine, however. Congress must find it more appealing to make a 
tough offer followed by an accommodating offer (conditional on a veto 
and no break down), rather than make an initial accommodating offer 


p> 
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that would be surely accepted. This requires that 


m[(1— p) (2e-q—-¢) +p(q—o)] 2 p(2e-4g-¢) 


or 
¥ c—2e-—q 
~@= 2e (1p) +9 — 29) 


In the example, this condition becomes 7 < cea We can now indicate 
a haggling equilibrium in the two period, two type sequential veto 
bargaining model with the ideal points indicated earlier. If .6<p<1 
and 7 < Tpthen C offers 6; = b,, = cand bb) = bo = .5. Presidential 
type m accepts both b. and b,, in both periods, while type e accepts 
offer 6. and vetoes b},, in both periods. Finally, C’s belief that the 
president is an extreme type is j4(e) = 1 following a veto. 

5.1.3. Bargaining over Multiple Bills. While the last section shows 
that incomplete information can effect the dynamics of bargaining on a 
single issue, McCarty (1997) considers how informational and reputa- 
tional incentives alter the bargaining across multiple issues over time. 
He considers a model of veto bargaining with incomplete information 
where P and C bargain over a series of policies with status quo points 
gq, and q2. In each of the two periods, C' proposes b; and the president 
decides whether to accept or reject it. . Thus, bargaining over each 
policy is modeled as one-shot such that if P vetoes b, the status quo 
q: is the policy outcome. Since the president’s ideal point is assumed 
to be constant across policies, the outcome on policy 1 may provide 
information to C prior to her making an offer on policy 2. Since in 
the last period, the game is identical to the one-shot incomplete in- 
formation game described above, type m does better on the second 
policy by having C' believe that he is the extreme type if preferences 
are such as those given in panel a or b. Thus, given those preference 
configurations, m may be willing to use his first period veto to build a 
reputation as the extreme type in order get a better outcome on policy 
2. This involves rejecting bills that he, but not type e, prefers to q. 
Thus, reputational incentives increase the likelihood of a veto on policy 
1. Given that C' understands these incentives, she may be willing to 
be sufficiently accommodating on the first policy to discourage type 
m from vetoing on reputational grounds. Thus, McCarty’s model pre- 
dicts a “honeymoon” pattern of accommodating policies early in the 
president’s term followed by less accommodating policies toward the 
end when reputational incentives are diminished. However, he notes 
that since the existence of reputational incentives depends on prefer- 
ence configurations such as those in panel a and b, this honeymoon 
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effect is unlikely when the expected difference P and C' is small such 
as during unified governments. 


5.2. Blame Game Vetoes. A recent model argues that vetoes 
are less a product of legislative uncertainty than of electoral politics. 
Groseclose and McCarty (2000) examines a model in which the legisla- 
tive agenda setter uses its proposal power to signal that the president 
has policy views that are out of step with the voters. In this “blame 
game” model, vetoes occur when the agenda setter receives a larger 
payoff from signaling that the president has extreme preferences than 
she does from enacting new policy. Thus, in this model, the elec- 
torate’s uncertainty about the president is critical, not the uncertainty 
of legislators. 

To illustrate a simple version of this model, consider a new actor V, 
the voter. We assume V also has linear preferences and an ideal point 
v. Following the notation of the last section, V believes the president 
is type e with probability 7 and type m otherwise. We focus on the 
case where e < m <_v. We assume the voter evaluates the president 
based on the expected distance between the president’s ideal point and 
her own ideal point. Therefore, the voters evaluation is just 


w(e,m,7;v) = —m|u—e|—(1—7)|u-—m| =7e+(1—7)m—-v 


An important feature of this model is that P and C’ care how much 
expected utility V gets from the president’s position. The most inter- 
esting case is one of conflict, in which the president gets greater utility 
when the voter believes he is a moderate and Congress gets greater 
utility when the voter believes the president is an extremist. Such a 
case would plausibly arise when Congress and the presidency are con- 
trolled by different political parties or factions, especially when those 
parties are highly polarized ideologically, and voters are generally more 
moderate. In such a case, C and P trade gains from enacting policy 
with gains from political posturing. More specifically, the president 
would like to take actions that lead the public to lower a while the 
legislature would like to take actions that lead the public to increase 
am. We allow C and P to value these trade-offs differently by letting X, 
and A, be the respective weights each place on policy. Therefore, the 
utility functions for C and P become: 


—,|x — cl + (1 —A-) (me + (1 — 27) m— v) 


and 
—Ap|x — p| — (1—A,) (we + (1-27) m—v) 
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An important assumption of this model is that while V is relatively 
uninformed about P’s preferences, C' is fully informed. Therefore, C 
may be able to credibly communicate its information about 7 through 
its choice of bill. Similarly, the president’s decision whether to veto 
particular proposals may also provide information to voters about his 
preferences. 

A particularly interesting equilibrium is one in which C’ proposes 
an acceptable bill when P is moderate and submits a bill that will be 
rejected when the president is extreme. McCarty (2002) shows that 
such an equilibrium is the only one in which vetoes occur and it exists 
if and only if the following two conditions hold: 


Xp — Ae 
(10.1) b= (me) 2 2(e-4) 
p/\C 
and 
Ap — 
10.2 2>4—_— 
(10.2) — Apre T 


These conditions produce a number of predictions about the occurrence 
of vetoes.’ First, note that the first condition cannot be satisfied if m = 
e ora = 1. Thus, voter uncertainty about the president’s preferences is 
crucial. Without this uncertainty, orchestrating a veto has no signaling 
value to C' so she might as well make acceptable proposals to both 
types. Next, note that both conditions are easier to satisfy when 7 
is lower. Since the ex ante evaluation of the president is decreasing in 
m (the probability he is extreme), the model suggests that vetoes will 
occur more likely when the public believes the president is moderate 
(that is, believes the president is ideologically proximate). Intuitively, 
Congress finds the blame game most attractive when it has negative 
information about the president’s policy preferences that is inconsistent 
with the voter’s beliefs. 

The next three prediction are based on C' and P’s willingness to 
trade policy gains for political gains. Figure 10.5 illustrates how each 
of the conditions are affected by the policy weights A, and A... The 
area under the higher solid line represents the combinations of A, and A, 
that satisfy the first condition. Alternatively, the area above the lower 
dashed line are those satisfying the second condition. The blame game 
equilibrium described above exists in the intersection of these regions. 
First, note that the first condition can be met only when A, > Ac, 


°These are conditions are necessary for the case of c > 2m — q+ ie (m — 
Pp 


e). Different positions of c result in slightly modified but qualitatively similar 
conditions. 
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suggesting that the president must put relatively more weight on the 
policy outcome than does Congress. If this were not the case, C' would 
prefer to achieve policy gains by passing mutually attractive bills rather 
than seek purely electoral advantage by passing bills the president will 
reject. However, the second condition puts an upper bound on the 
difference in policy weights. If A, is much greater than \., C’ loses the 
ability to signal credibly through its proposals. One final prediction 
emerges from the fact that only extreme types ever veto in the blame 
game model. Since only type e vetoes, every veto is followed by a 
reduction of voter support. 


Insert Figure 10.5 Here 


6. Application: Crisis Bargaining 


One of the limitation of bargaining theory is that solutions are gen- 
erally highly dependent on the bargaining protocol and are therefore 
not robust to changes in the extensive form of the game. In the context 
of veto bargaining, this is not such a large problem because its protocol 
is often codified in constitutional provisions and well-established leg- 
islative procedures. However, in the case of crisis bargaining among 
sovereign states, it is clearly less desirable to have bargaining solutions 
depend heavily on particular extensive forms since the relevant proto- 
cols are generally more informal, non-codified, and unobservable due 
to secrecy concerns. 

Recognizing this problem, Banks (1990) considers what equilibria 
of a large class of crisis bargaining games have to have in common. 
Consider the following crisis bargaining scenario. Two states 1 and 
2 bargain over 1 unit of territory. Let x be the share that country 
1l’s. Following Banks, we assume that both countries are risk neutral 
so that we can define country 1’s payoffs from a settlement as x and 
country 2’s as 1—z. Failure to reach an agreement on the division of 
the territory leads to a war.'? We assume that country 1’s expected 
utility of a war is wu and country 2’s is v. These expected utilities 
encapsulate expectations about the probability of winning the war, the 
benefits of winning and losing, and the allocation of territory that the 
winner can secure. 

Banks assumes that country 1 has an informational advantage vis a 
vis country 2 about the values of (u,v). Following the usual practice, 
he models this asymmetric information by assuming the country 1’s 
type is some t € T defined such that country 1’s expected benefits of 


!0We include in the set of possible agreement that the the status quo ex ante 
remains intact. 
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war, u(t), are increasing in its type. While country 1 learns t prior to 
negotiations, country 2 has only a common knowledge prior f(t) over 
T. 

Given this framework, standard game theoretic models would spec- 
ify a set of decisions available to the countries and a (probabilistic) 
outcome function specifying the probability of a war and the distribu- 
tion of settlements as a function of these decisions. From this model, 
equilibrium strategies (o;(t), 2) from which the equilibrium probabil- 
ity of war p(t) and expected settlement x(t) can be derived. In such 
an equilibrium, country 1’s payoffs are 

U(t;a,p) = p(t)u(t) + (1 — p(t))u(t) 
Clearly, not every p(t) and x(t) can arise from a Bayesian equilibrium. 
In particular, Bayesian equilibrium places two requirements. The first 
is incentive compatibility. Type t cannot prefer the outcomes p(t’) and 


z(t’) to p(t) and x(t). Otherwise it would defect from its equilibrium 
strategy o;(t). Thus, incentive compatibility requires 


(10.3) p(t)u(t) + (1 — p(t))x(t) > p(t’)u(t) + (1 — p(t’) z(t’) 
(10.4) p(t’)u(t’) + 1 — p(t’) x(t’) = p(t)u(t’) + (1 — p(t) a(t) 

The second condition imposed by Bayesian equilibrium is individual 
rationality.’ It cannot be the case that u(t) is greater than x(t) unless 


p(t) =1. Otherwise, t would withdraw from the agreement and start 
a war with probability 1. The individual rationality constraint is 


(10.5) p(t)u(t) + (1 — p(t))a(t) = u(t) 

While incentive compatibility and individual rationality are minimal 
requirements, we will see that they impose quite a bit of structure on 
bargaining outcomes. The most important feature of the Bayesian 
equilibrium concern the monontinicity of p, x, and U in t. 


LEMMA 10.1. If p and x are incentive compatible and individually 
rational, then p(t) is weakly increasing on T. 


PRooF. Let t/ > t. Note that we can subtract the right side of 
equation 10.3 from the left side of equation 10.4 and the left side of 
equation 10.3 from the right side of 10.4 to produce 


p(t')[u(t!) — u(t)] = p(t)[u(t’) — u(t))] 
Since u(t) is strictly increasing, u(t’) — u(t) > 0 so that it must be 
the case that p(t’) > p(t). O 


"The similarity of Banks’ approach to mechanism design should be obvious to 
the attentive reader. 
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This lemma shows that in any Bayesian equilibrium the probability 
of war cannot decrease as country 1’s expected utility of war increases. 
Not only is this a feature of strategic models, it is consistent with the 
assumptions of a number of decision-theoretic models of war. 

For the next result, we need to define the set of types who resolve 
the dispute through bargaining with a positive probability for some 
outcome p,x. Let T;(x,p) = {t € T: p(t) < 1}. Note that individual 
rationality requires that x(t) > u(t) for any t € Tj. 


LEMMA 10.2. If p and x are incentive compatible and individually 
rational, then x(t) is weakly increasing on Ty. 


PROOF. Let t,t’ € T, and t’ > ¢t that that Lemma 1 implies 1 > 
p(t’) > p(t). Since x(t) > u(t) for all t € T,, we know that 


(10.6) p(t)u(t’) + (1 — plt))a(#) = pt )u(t’) + (1 — pt’))a(#) 


We can combine this result with equation 10.4 to produce 


p(t)u(t’) + (1 — p(t))a(t!) = p(t)u(t’) + (1 — p(t))a(t) 


which reduces to x(t’) > x(t) after dividing by (1 — p(t)) which we 
know is positive since t € Tp, | 


Not surprisingly, country 1 must do at least as well in the bargaining 
outcome when its war utility improves. Thus, taken together Lemmas 
2 and 3 suggest that in any Bayesian equilibrium that higher types 
may get better bargaining outcomes, but that this comes at a greater 
risk for war.!? However, incentive compatibility requires that these 
trade-off benefit higher types (or else they would mimic lower types). 
Let T,, = {t € T : p(t) > 0} so that T,, is the set of types that go to 
war with some probability. 


LEMMA 10.3. If p and x are incentive compatible and individually 
rational, then U(t; x, p) is weakly increasing on T and strictly increasing 
ORLA: 


PROOF. Let t’ > t, and t’,t € T,,. Suppose (contra the lemma) 
that U(t) > U(t’) so that 


p(t)u(t) + (1 — plt))a(t) 2 p(tyu(t’) + (1 = pt))a(¢) 
Since u(t’) > u(t), this implies that 
(10.7) p(t)u(t!) + (1 — p(t) a(t) = p(t )u(t) + 1 — pt) 2(#/) 


Banks also shows that if 2 and p are incentive compatible and individually 
rational then x(t’) > x(t) if and only if p(t’) > p(t) for t’ >t and t,t’ € T, 
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Since t/,t € Ty, p(t) and p(t’) are greater than zero so that equation 
10.7 violates equation 10.4. Thus, U(t’) > U(t). However, this strict 
equality does not hold on T/T,. If t,t’ € T/T, then p(t) = p(t’) = 0 
so that equations 10.3 and 10.4 clearly imply that x(t) = z(t’) and 
CG= 07). O 


Thus, having a better expected utility of war cannot make country 1 
worse off. In fact, for types that go to war with a non-zero probability, 
higher war payoffs leads to strictly higher equilibrium payoffs.'’. 

While the incentive compatibility approach can go a long way to- 
wards telling us what predictions are generic to crisis bargaining mod- 
els, there are a number questions that it cannot resolve. For example, 
we do not learn how country 2’s perceptions of country 1’s war utili- 
ties, as measured by the priors f(t) effect the likelihood of war or the 
bargaining settlement. For that we will have to turn to more explicit 
models of crisis bargaining. 


6.1. Models of Crisis Bargaining. Fearon’s (1995) seminal ar- 
ticle explores several models in the class covered by Bank’s results. 
He gives a specific form for (u,v). In particular, Fearon assumes that 
each country has a cost of war c; > 0, country 1 wins any war with 
probability 7 € (0,1), and the winner of the war can impose its most 
preferred settlement (2 = 1 for country one and x = 0 for country 2). 
Therefore, u = 7 —c, andv=1—a7—-— cy. Let x be the status quo 
allocation of the territory. 

Given this framework, we can define the set of agreements that 
each side will accept in lieu of going to war. For country 1, we require 
that x > m — cq and for country 2 we require 1—2x > 1l—a7—- oc. 
Therefore, any allocation x € [m7 — c1,7 +2] avoids conflict. Since 
the costs of war are positive, the set of peaceful agreements is non- 
empty. Therefore, under perfect information, we should expect that 
one of these agreements will be reached and war will be avoided.'* 

Fearon considers a simple model with incomplete information about 
country 1’s costs. While he assumes a continuous distribution of ¢1, it 
suffices for us to consider a cost where c; can take on only two values 
€>c. The common knowledge prior is that c4 = c with probability .. 
Fearon first considers a model where country 2 makes a single take-it- 
or-leave-it offer to country 1. If country 1 rejects it, war ensues. 


!3Banks also shows that U(t;x,p) is continuous in t, but we refer the reader to 
his article. 

MThis statement of course assumed that the territory is infinitely divisable so 
that an agreement in thus region is feasible. This may not be the case in some 
disputes especially when the terrotory involves religious or ideational significance. 
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In analyzing this model, note that if country 2 offers x > a—c both 
country 1 types will accept and war will be avoided. Clearly, country 
2 has no incentive to pay higher than 7 — c, so lett x =a-c. If 
country 2 offers x € (a — @,m —c], only the low cost type will accept it 
so that war starts with probability A. Of these offers, country 2 prefers 
%=7-—C. Finally, if country 2 offers x < a — both types reject 
and a war starts with certainty. This generates a payoff to country 2 
of v=1—a-— cy». Thus, country 2’s choice boils down to a choice of 
three utilities 1 — a2, \v + (1—A)(1—%), or v. We can easily dismiss 
the third option. Since the interval [7 — c, + cy] is non-empty, we 
know that 1— 2 > v. Thus, country 2 will never choose to sabotage 
the negotiations to generate a war with probability 1. Now we can 
determine country 2’s preferences over the remaining offers. Clearly, 
Av + (1 — A) (1 — &) > 1 — xwhenever 


Co +E 
Thus, when the probability that country 1 has low costs is sufficiently 
low, country 2 will take an aggressive bargaining stance that risks war 
in the event that country 1 actually does have low costs. Note that 
the critical threshold is decreasing in country 2’s costs since it is less 
willing to take such a risk when its military capabilities are low. 
We can easily check that Bank’s results hold trivially for this model. 


When A > aes both types get the same allocation and have zero 


probabilities of going to war. When A < —s, both types are still 
offered the same allocation but c goes to war with probability one. 

As Fearon notes there are reasons skeptical of informational ex- 
planations for war. Perhaps opportunities for communication should 
resolve such informational asymmetries and avoid war. However, given 
that the countries have diametrically opposed preferences over the allo- 
cations, it is easy to show that cheap talk will not influence bargaining 
or the probability of war. Now let country one announce H or L as 
a signal of its costs @ and c, .respectively !° Following the message, let 
” be country 2’s updated beliefs about 1’s costs. Clearly, based on 
the these updated beliefs, country 2 will use the same cutpoint rule 
as before. First we consider whether there are separating equilibria 
where type ¢ reports H and ¢ reports L. In such a case, \*(H) = 0, 
A*(L) = 1, «(H) = @, and 2(L) = xz. The probability of war would be 
zero. Therefore, separating messages requires the incentive compat- 
ibility conditions x(H) > x(L) and a(L) > 2(H). These conditions 


The restriction to two messages or enbuing them with literal meaning is not 
consequential. 
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clearly fail since x > 7. We leave it to the reader to verify that there 
are no partially information semi-pooling equilibria. 


6.2. A Model of Escalation*. This section is based Fearon (1994) 
who develops a version of the war of attrition to explore how “audi- 
ence costs” imposed on states who back down in international disputes 
effect the dynamics of crisis escalation. 

Assume that two states 1 and 2 are in a dispute over a prize worth 
v > 0. The game is played in continuous time beginning at t = 0. 
At every instant each state can choose among three strategies: attack, 
quit, or escalate. The game continues until one or both of the states 
quits or attacks. If both states choose escalate, the game continues. 
If either state attacks before the other quits they both receive their 
expected payoffs from war w; < 0. Fearon interprets these payoffs as 
resolve since the state with the higher w; is relatively more willing to 
engage in military conflict to settle the dispute. 

Fearon’s interest is in understanding how the sanctions imposed on 
leaders who back down during disputes effect crisis behavior. There- 
fore, he assumes that if state 7 quits before state 7 at time t if suffers 
audience costs a;(t) which are strictly increasing in t. The dependence 
on t is designed to reflect the intuition that it is more costly to back 
down during a protracted dispute than in a short one. 

A pure strategy in this game specifies for any subgame beginning 
at time t’ a rule specifying a finite time t > t’ at which to attack or 
quit.'° We can write these strategies as {t, attack} meaning “escalate 
until t and then attack” or {t, quit} to represent “escalate until time t 
and then quit.” 

Before turning to the more general model where each side is un- 
certain of the other’s resolve, it is instructive to consider the case of 
complete information. For each state, we can compute the time ¢ for 
which it strictly prefers to attack rather than back down. Clearly, this 
occurs when w; > —a,(t). Let 


t = —a;' (wi) 

Suppose, either because state 1’s resolve or audience costs are higher, 
that ¢; < t:. Thus, at t,, state 2 prefers still prefers to quit than be 
attacked, thus it will quit. Let @,(t;t’) be the probability that state 
i quits before time ¢ conditional on it not quitting before ¢’ and Q;(t) 
be the unconditional probability of quitting by time t. 


16Not allowing “never quit” to be part of the stragy set eliminates the uninter- 
esting equilibrium where both sides choose this strategy and escalate forever. 
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Thus, at every subgame 0 < t’ < f1, state 2 receives —ao(t’) for 
quitting immediately and Q,(t;t’)u — (1 — Qi(#;t’))aa(t) for {t, quit}. 
However, consider state 1’s strategy. From subgame 0 < t/ < hy, 
strategy {t1, attack} has a payoff of v while {t, quit} has a payoff of 
Qo(t; t’)u — (1 — Qo(t;t’))ae(t) < v. Therefore, @Q(t;t’) = 0 for all 
t <t,. Now we can see that state 2’s payoff from {t, quit} is —a9(t) < 
—do(t’). Thus, state 2 will quit immediately at every subgame 0 < t! < 
t, including 0. The equilibrium with complete information therefore 
involves state 2 stopping immediately and state 1 claiming the prize. 

The complete information equilibrium has the property that both 
high resolve and audience costs lead to better crisis bargaining out- 
comes. However, it has the unsettling prediction that no crises ever 
occur, because the weaker side capitulates immediately. Therefore, 
Fearon also considers an incomplete information version of the game 
where each side is uncertain of the others resolve. We assume that the 
resolve of state 7 is distributed according to the cumulative distribution 
function F; on the interval [w,, 0]. 

Just as in the complete information game, the equilibrium depends 
on defining a time point after which neither state will wish to quit. 
Fearon refers to such a time point as the horizon of the crisis game. 
Formally, we define this horizon point as t,, the earliest time point at 
which that Q;(t) is not increasing for t > t, for i = 1,2. 

Fearon observes that the following must be true in any PBE": 


(1) Both states quit simultaneously with probability zero. Sup- 
pose states 1 quits with positive probability mass at t’. Then 
clearly state 2 has an incentive to wait at until at least t/ + ¢ 
before quitting as this increasing its probability of winning v 
substantially with only an infinitesimal increase in its audience 
costs. Similarly, there are no PBE where a state quits con- 
temporaneous with an attack from the other state. Again, if 
state 2 expects that state 1 will quit with probability mass at 
point ¢’, it should hold off its attack until t’/ + ¢. This implies 
that both states cannot plan to quit at ft). 

(2) State ¢ will not attack during at time ¢’ if Q;(t) is increasing 
at t’. Since attacks have negative expected utility, it pays to 
wait longer in the hopes that the opponent will drop out prior 
to the attack. 

(3) Both states will quit with positive probability in time intervals 
arbitrarily close to t,, By the definition of t;,, at least one state 


7For formal statememnts of these observations and their proofs, see Fearon 
1994. 
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must quit with positive probability in the arbitrarily small in- 
terval before th. Suppose that this is true for state 1. Now 
suppose to the contrary that state 7 quits with zero proba- 
bility after time t’ < t,. Clearly from observation 2, state 
2 will not attack between ¢’ and t,. Therefore, quitting with 
positive probability after t’, state 1 unnecessarily increases its 
audience costs — it knows it will quit before state 2 yet it keeps 
escalating. 

(4) Both states attack with probability zero for t < t,. This 
follows directly from observations 2 and 3. ‘Therefore, the 
utility of strategy {t > tn, attack} for state 7 is 


(10.8) Us (t, wi) = Qj(tr)u + (1 — Q5 (ta) wi 
while the utility of {t < t,, quit} is 
(10.9) Ui (t) = Qj(t)u — (1 — Q; (ti) a(t) 


Since U#(t) does not depend on w;,{t < tn, quit} can only 
be a best response if it is constant for all t, say k;. Suppose 
this were not true and let U/(t’) > Uj(t) for some t’ < tp, 
and all t < t,. Then all types such that U}(t’) > U#(th, wi) 
would quit exactly at t’. State 7’s best response would then 
be {t > t’, quit} making U!(t’) = —a;(t') which contradicts 
Us (t') > UF(t) for all t < tp. 

Based on these observations, the following lemmas help to charac- 

terize the perfect Bayesian equilibrium for this game. 


LEMMA 10.4. In any equilibrium in which both states choose to 
escalate with positive probability, there must exist a finite horizon tp. 


The logic of this lemma straightforward. Suppose to the contrary 
that there were a PBE where Q;(t) were increasing for all t. By 
observation 2, state 7 will not attack with probability 1 which turn (by 
observation 4) suggests that U#(t) is constant for all t. This implies 
that ; (i) 

§ + Ay 
(= 

Qi v +a;,(t) 
However, since j never attacks, it must be the case that jim Ob) 1 
This can only be true if k; = v or jim a(t)=1. Ifk; =v, Q;(0) =1 
implying that 7 does not escalate with probability 1. If jim ali, 
i will not wish to choose {t, quit} for arbitrarily large t. 


LEMMA 10.5. In any equilibrium with t;, as the horizon and in which 
escalation may occur, (1) if state i chooses {t, attack} it must be the 
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case that t > tn; and (2) state i will choose {t, attack} where t > tp, if 
w; > —ai(tn) and only if w; > —a;(tn). 

Part 1 of this lemma follows directly from observations 2 and 3. 
Ignoring several technical complications, part 2 follows from the fact 
that U?(t,w;) > Uf (t) if and only if w; > —a;(tp)."® 

From lemma (2) we know that the ex ante probability that state j 
will attack at tp, is (1 — Fj(—a,(t,))). Thus, state 7’s ex ante utility of 
escalating up to ¢, and then backing down is 


ui(tn) = Fj(—a;(ta))v — (1 — Fj(—a;(tn))) aa(tn) 
We can then define t* such that u;(t*) = 0. Thus, t* has the property 


that state 7 is indifferent between escalating to time t* and conceding 
immediately.'® 


PROPOSITION 10.4. Let t¥ be the unique solution u;(t*) = 0 and 
t* = min{t}, t3}. For any equilibrium in which escalation occurs with 
positive probability, the horizon must be t*. 


If t, were greater than t*, the state with the lower t7 would have an 
incentive to quit with probability 1 before tg. This would contradict 
observation 3. If tq were greater than t*, then both states would 
have an incentive to bluff a little longer at tg before quitting. This 
contradicts the definition of ty. 

These lemmas lead directly to the main result. 

PROPOSITION 10.5. Label the players so that t* = t} < t]. Let 
ky = u(t*) > 0. The following describes equilibrium strategies for 
state i = 1,2 as a function of type w;: 

For w; > —ait*, state i plays {t, attack} for any t > t*. 

For w; < —a;t*, state i plays {t, quit} with any pure strategies that 
yield the following cumulative distributions 
= 1 ag (t) 

F\(—ayz(t*)) v + ae(t) 
= 1 ky + ay(t) 

Fo(—ae(t*)) v + ar(t) 
Fort <t*, state i believes that the probability that 7 will not back down 
is given by 


V(t) 


WV2(t) 


'8The technical complications involve ruling situations were Q;(t) has mass 
points. 

'9We implicitly assume that the range of a;(t) is sufficiently large that there is 
a unique solution to u;(t7) = 0. 
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For t > t*, state i’s beliefs follow Bayes’ Rule in accord with the op- 
ponents’s strategy for attacking. For any t > t* off the equilibrium 
path, let i believe that w; > —a,(t*) and is distributed according to F; 
truncated at —a,(t*). 


PRooF. Let Q(t) be the unconditional probability that state i 
quits by time t From lemma (2) and Proposition (1), Q;(t*) = F; (—a;(t*)). 
The utility to state i of {t, quit} is therefore 

Q;(t)u — (1 — Q3(#) ait 
To ensure that 7 is indifferent between quitting and continuing for any 
t < t*, we require that Q,(t)v — (1 — Q;(t))ai(t) = us(t*) or 

u(t") + a(t) 
Q;(t) = ——__~— 
v +a,(t) 
Since only types w; < —a,(t*) ever quit, these types must be quit at 
1 — 1 u,;(t*)+a,(t) 

rates Bayes) or Wi) = Rae) otal) 

We know that by time t, ;(¢) of the types in the interval [w,, a;t*) 
will have dropped out so that 
- 1 — Fy(—aj(t) 

1 — Fi(—a;(t)) + Fj(—a,(t)) (1 — Y(¢)) 
1 — Fi(-a;(t)) _ (l= Fi(—a;(t))) &@ + a(t) 


fe) Oe ogy. = v—u,(t*) 


Pr(wy > -a(t")) = > 


Pr(w, > —a,(t*)|t) 


O 


In this PBE, types with low resolve from state 7 drop out at a 
rate designed to keep the low resolve types state j indifferent between 
dropping out and escalating through time t*. At time t*, both states 
attack because they know that all of the low resolve types for the other 
state would have dropped out by then and escalating would simply lead 
to larger audience costs. 

It is instructive to explore why low resolve types of state 7 quit at a 
rate to make low resolve types of state 7 indifferent between dropping 
out and escalating. If they dropped out at a faster rate, all low-resolve 
types in state 7 would conclude at each t that state 7 is more likely to 
be a strong type. This would lead low-resolve types of state 7 to quit 
more quickly. Then state 7 would then begin to infer that the pool 
of state 7 types is stronger, and begin to drop more quickly. In the 
limit, all low-resolve types would quit at t = 0, but this cannot be an 
equilibrium (recall Observation 1). 
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Now consider what happens if low resolve types from state 7 drop 
out at a slower rate than the equilibrium. Then at each t, state 7 would 
infer that the pool of remaining types is weaker than the corresponding 
equilibrium pool. This would lead state j types to also drop out at a 
slower rate, which in turn induces state 2 to escalate more, and so on. 
Such a dynamic would lead to all types to prefer escalating until ¢*. 
However, this cannot be an equilibrium because the low resolve types 
would have clear preference for dropping out before t* than attacking 
at t. 

An important substantive feature of the model is that it turns out 
to be beneficial to be able to incur large audience costs. Such a state 
is better able to convince its opponent that it is “locked-in” to the 
conflict since a larger set of types are willing to escalate to the horizon 
and then attack. This runs counter to a simple intuitive prediction that 
those with the most to lose by backing down will back down earlier. 
However, conditional starting a crisis, the signaling value outweighs this 
effect. By modifying his model slightly to include an explicit initiation 
phase, Fearon argues that this framework provides a justification for 
why democracies (highly sensitive to audience costs) might be less likely 
to initiate conflict, but will be more likely to prevail. 


7. Exercises 


EXERCISE 10.1. Let U;(x;) = Ina; fori € {A,B}. Solve for the 
Nash bargaining solution as a function of the disagreement values. 


EXERCISE 10.2. In the Rubinstein bargaining model with 6, = 62 
and d, = dz = 0, assume that u;(x;) = x% where0<a<1. Compute 
the SPNE shares. What is the effect of risk-aversion (lower a)? 


EXERCISE 10.3. Consider the Baron-Ferejohn model where u;(x;) = 


®*. Show that the initial proposer’s share is decreasing in a. 


aes 


EXERCISE 10.4. In the model described in Section 8.2 with N = 3 
andm = 1, suppose that — GR :. Compute a mized strategy 
equilibrium where v4 = Up. 

EXERCISE 10.5. In the model described in Section 3.2, compute v4 


and vg for generic values of N and n. 


EXERCISE 10.6. Consider an extension of the model described in 
Section 4.1. First assume that there is two rounds of bargaining so that 
A makes a counter offer if it rejects B’s initial offer. Assume that 
the payoffs are discounted by a factor 6 if agreement is reached in the 
second round. What is the PBE to this game? Now assume that there 
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is incomplete information about B’s disagreement value where up = 0 
with probability 7 and ug = d withe probability 1 — 7? Construct a 
PBE equilibrium to this game. Is it unique? 


EXERCISE 10.7. Consider an extension of the model considered in 
Section 3.3. Assume that there are two groups of bargainers, 1 and 
2. Let m; be the number of members of group i so that my +m = N. 
Suppose that all members of group i have a qualified veto power in that 
if they object to the proposal k; votes are required to override their veto 
where 0 < kj; < N. Assume that ky > ko. Compute continuation 
values for members of each group. 


EXERCISE 10.8. In the model described in Section 6.1, assume that 
Cc, is distributed uniformly on the interval |c,¢|. Compute the perfect 
Bayesian equilibrium. Now consider the extension with pre-bargaining 
cheap talk. Show that if €, there is a PBE where the high cost types 
reveal information about their cost to country 2. 


CHAPTER 11 


Mechanism Design and Agency Theory 


So far we have discussed techniques for analyzing how strategic 
agents behave in specific games. In social settings where the “rules” 
are fairly clear, this approach to game is a powerful source of empirical 
predictions about the outcomes of strategic interactions. 

An alternative approach is to ask a slightly different question: given 
a desired outcome, what game should be designed so that strategic 
agents will produce it? The field of game theory that asks such ques- 
tions is called mechanism design. Here a designer or principle selects 
a Bayesian game, or mechanism, for an agent or group of agents to 
play. Examples include the selection of tax codes to induce agents to 
reveal their willingness to fund public projects, the design of auctions 
to maximize revenue, and selection of reelection functions by voters to 
create incentives for government officials to “behave” while in office. 

Typically, we model the choice of mechanisms as a maximization 
problem given the designer’s preferences over agent types and out- 
comes. If the designer’s preferences can be interpreted as an objective 
notion of social preferences then the problem of selecting a mechanism 
can be viewed as a normative exercise. A classic example is the se- 
lection of rules to determine the provision of public goods so as to 
maximize the sum of individual utilities. Given this normative inter- 
pretation, mechanism design is closely related to social choice theory. 
A version of mechanism design known as implementation theory seeks 
to uncover classes of choice rules (mappings from agent types to collec- 
tive decisions) for which there exists a mechanism which will achieve 
the choice function. Choice functions of this type are said to be imple- 
mentable. The Gibbard-Satherwaite theorem was an example of this 
type of work. 

While the applications of mechanism design are often normative or 
prescriptive, we also may use it to make positive predictions. For ex- 
ample, mechanism design is often used to investigate whether a poorly 
informed principal (e.g. legislature or executive) can create incentives 
so that well informed agents (e.g. committees or bureaucrats) take 
actions to achieve ends which the principal desires. 
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In most applications in economics, it is assumed that the designer 
has a very rich set of games to choose from. She is usually free to com- 
mit to contracts with very elaborate reward and punishment schemes. 
Such assumptions seem entirely reasonable in economic settings where 
third-parties such as courts may be counted on to enforce complex 
agreements and large monetary rewards and sanctions are considered 
legitimate. However in applications to politics, it is often unreason- 
able to believe that principals can pre-commit to reward scheme be- 
cause third-party enforcement is often unavailable. Also, the monetary 
incentives may be legally or socially proscribed. Accordingly, after pre- 
senting some basic concepts and results of implementation theory and 
mechanism design, we will focus much of our attention on the design 
of incentive mechanism when the principal is more constrained. 

To get a feel for mechanism design, let’s consider one of the classic 
examples — a political science department of n members and a chair that 
is deciding whether to purchase a nice Saeco espresso maker. The fancy 
coffee maker has a cost c and the chair wants to see if the department 
members value the machine enough to justify the expense. Assume 
that each member’s valuation of the coffee maker is 6; € Ri. The chair, 
a Benthamite and non-coffee drinker', wishes to purchase the machine 


if and only if S- 0); >c. Unfortunately, the chair does not know the 


valuations of menacual department members. Instead she believes 
that each members type is drawn from the probability distribution 
F(-). What should the chair do? One solution might be to privately 
ask each member for his valuation and purchase the espresso maker 
with department funds if the total announced valuations exceed c. The 
problem is that some colleagues might find it advantageous to inflate 
their valuations in order to increase the likelihood that the machine 
is purchased. Thus, this scheme induces a game where each faculty 
member’s best response is to report a valuation that exceeds his true 
preference. So this solution may not do a very good job in determining 
whether the espresso maker should be purchased. The chair may decide 
that a better solution is to ask each member to contribute her valuation 
and then if the total contributions exceed c, purchase the maker, and 
keep the surplus to pay for coffee beans. If the contributions do not 
reach c, the chair would return them. This mechanism is also flawed. 
Now the strategic scholars may decide to understate their valuations 


‘Jeremy Bentham (1748-1832) was a prominent British philosopher who ad- 
vocated an ethical system based on pursuing the “greatest good for the greatest 
number.” Historians record that he was also a coffee drinker. 
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hoping to free-ride off the contributions of their colleagues. This rule 
creates the classic collective action problem. 

While neither of the preceding schemes works well, we can use the 
theory of mechanism design to uncover a class of particularly simple 
mechanisms that can be used by the chair to learn the faculty’s pref- 
erences. Groves (1973) and Clarke (1971) show that the following 
mechanism has good properties. 


e Ask each faculty member to e-mail her valuation m; to the 
chair. 


e if S- m,; > c purchase the coffee maker, otherwise do not. 


i=l 
e If the coffee maker is purchased collect from faculty member 7 
the amount t;(m;.m_;) = c— 3 ire 
At 


rae 
e If the coffee maker is not purchased, collect no money. 


We can demonstrate that given this mechanism each faculty mem- 
ber has an incentive to reveal her true valuation regardless of the other 
members’ valuations. A key property of this mechanism is that mem- 
ber 2’s message only indirectly effects her contribution through its effect 
on the ultimate decision about whether to purchase the espresso maker. 
The amount that each member pays depends on the messages of all the 
other members. 

To see that all members will offer truthful messages, consider the 
decision of member 7 with type @;. Suppose that she were to lie 
with the announcement m/, < 6;. This understatement affects the 
outcome only if affects the likelihood that the machine is purchased, 


or if Som; +m <cc< Som; + 6;. Under these circumstances, 


IAA JAA 
C= Som; < 6; so that the contribution required from truthful an- 
hall 


nouncement, c — y m, is less than the member’s value of the new 


espresso maker. This deviation from a truthful response can only make 

the department member worse off. Now consider whether a member 

has an incentive to overstate her demand for espresso with a message 

mi, > 6;. This fabrication only effects 2’s utility if the inflated message 

results in purchasing the machine where the truthful message would 

not have. Such a scenario requires that S- Mig ing 6 > S- m+ 6; 
iti ii 
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so that c— Som; > 6;. Thus, member 2’s contribution is more than 


ii 
her value the coffee maker. So lying doesn’t pay. 

Beyond promoting honesty in departmental affairs, the Groves- 
Clarke mechanism has the desirable property that the espresso maker 
is purchased if and only if the aggregate valuation of the department 
exceeds its cost. However, it has a less desirable property that it is not 
“budget balancing.” When the machine is purchased, the chair collects 


S- ti(mj.m_;) = ne- S- S- m, which is greater than or equal to c. 
i=1 i=1 ji 
Of course, this isn’t much of a problem as far as the chair is concerned 
—a little compensation for having to send and read all those e-mail 
messages. 


1. The Mechanism Design Problem 


Now we consider the mechanism design problem in a more abstract 
setting. Consider a set N of n agents and a mechanism designer 
(denoted agent 0). The designer ultimately selects a policy x € X. 
Each agent has a private type 6; € ©, with the joint type vector 0 
drawn from the joint distribution function F(0). Agents also have 
Bernoulli utility functions u;(xz,6) : X x 0” — R' that depend on 
the chosen policy and the agents type. The mechanism designer has 
a Bernoulli utility function uo(z,@). In many application agents care 
only about their own type, but we allow agent’s payoffs to be a function 
of the entire profile. The primitives of a mechanism design problem 
are therefore (0, F'(0), X,u). 

In a typical application, the mechanism designer will elicit a public 
vector of signals from the agents. The designer chooses a message space 
for each agent MM; and a policy function p(m) : I] M; — X, that selects 


iGN 
a policy for every possible profile of messages m = (m1, mMa,...., Mn) € 
M= I] M;. Accordingly a mechanism is a pair (, p(-)). 


ieN 
For a given choice of message spaces and policy function, the n 
agents play the Bayesian normal form game with the strategy sets 
S; = M; and payoffs given by the composition of u and p, u;(p(m), 6). 
It is straightforward to see how the espresso mechanism maps into 
this general framework.. Clearly, the chair is the designer who selects 


the message space V = © and implements the policy “buy if S- Mi => C 
t=1 
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and charge c — S- m, to agent 7.” The faculty members then play a 
j#i 

Bayesian normal form game with strategies and payoffs determined by 

the chair’s choices. 

Given a mechanism, determining how agents will behave requires 
specifying a form of rationality. One possibility is to make predictions 
only if the agents have dominant strategies in the induced game. These 
are known as dominance solvable mechanisms. Alternatively, we could 
assume that agents select a Bayesian Nash equilibrium. Clearly, the 
question of how well the mechanism performs rests on assumptions 
about how the agents play the induced game. 

In this chapter we will focus on Bayesian Nash equilibria. A large 
literature exists in economics using other solution concepts. For ex- 
ample, the Groves-Clarke mechanism originated in literature on imple- 
mentation in dominant strategies... Given the focus on Bayesian Nash 
equilibria, we wish highlight the types of choice functions g : 0 — X 
that satisfy the following condition: there exists a mechanism (M, p(-)) 
such that if agents play a Bayesian Nash equilibria to the mechanism 
then the final outcome corresponds to the policy that would be selected 
by the choice function, p(m(@)) = g(@). 

From the mechanism designer’s perspective the mechanism is in- 
strumental to achieving a particular choice function. If the designer 
wishes to implement the function g(@) in Bayesian Nash strategies then 
she must select a mechanism (M, p(-)) such that in the corresponding 
game has a Bayesian Nash equilibrium in which the agents use strate- 
gies m3(0;) so that p(mi(A1), m*(0;),...,m*(On)) = g(@). Thus the 
choice of a mechanism is informed by knowledge of the incentives the 
mechanism creates, and the designer anticipates how these incentives 
will shape behavior by anticipating that agents will play equilibrium 
strategies to the mechanism. 


DEFINITION 11.1. Given a mechanism design problem (0, F'(0), X,u) 
we say the choice function g(@) is implementable in Bayesian Nash 
strategies if there exists a mechanism (M,p(-)) which has a Bayesian 
Nash equilibrium mi(-) in which p(mi(01),m(0;), ..., M2 (On)) = g(0) 
for every0€ 0. 


If a game is dominance solvable, the surviving strategy profile will 
be a Bayesian Nash equilibrium. This means that given a mechanism 
design problem the set of choice functions which are implementable in 
dominant strategies is a subset of the set of choice functions which are 
implementable in Bayesian Nash strategies. 
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Our first result, the so-called revelation principal, dramatically sim- 
plifies the search for choice functions which are implementable by al- 
lowing us to focus on a smaller set of possible mechanisms. Direct 
mechanisms are mechanisms in which the agents are asked to report 
their types directly. Thus, direct mechanisms have M; = 0;. The 
revelation principal says that if there exists a mechanisms to implement 
choice function g(@) then there must exist direct mechanism that im- 
plements g(@). This powerful result tells us that we need not consider 
all possible mechanisms just the direct ones. 

While revelation principal is quite general, its proof is very straight- 
forward. It can also be extended to a very large class of equilibrium 
concepts. 


PROPOSITION 11.1. (Revelation Principal in Bayesian Nash strate- 
gies). Given a mechanism design problem (0, F(0),u) , if the choice 
function g(@) is implementable in Bayesian Nash strategies then there 
exists a direct mechanism (0, p(-)) that implements g(0) in Bayesian 
Nash strategies. 


PROOF. We begin by assuming that there is a non-direct mecha- 
nism (M,p'(-)) that implements g(@) in Bayesian Nash strategies. We 
use this mechanism to construct a direct mechanism that also imple- 
ments the choice function g(0). Let s;(6;) denote the strategy that 
player 7 deploys in one of the Bayesian Nash equilibria to the game 
induced by (M,p’(-)). Consider the direct mechanism in which agents 
are asked to announce messages m; € ©; and then the policy is cho- 
sen by the function p(@) = p'(s1(01), ..., 5:(0:),..-, Sn(On)). We need 
only verify that under the direct mechanism, truthful announcements 
of m;(0;) = 0; form a Bayesian Nash equilibria. First, suppose that all 
agents N\i are playing truthful strategies. If agent 7 also uses a truth- 
ful strategy then the final outcome will be g(@). Now suppose that 
there is a desirable deviation m/, 4 6, in the direct mechanism for agent 
i with type is 6, € ©;. If agent i can select m’, in the direct mechanism, 
it must be the case that mi, € 0;. However, since s;(-) : 0; — M; in 
the game induced by (M,p’(-)),it must be the case that s;(m’) € M; 
exists. This implies that 


i co, ML C2101), 1 8500), Sa(On) AF (0+ | 8) > 


: co. iP (er); v5 8i(94), «5 Sn(On))dF'(O_; | 0:) 
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This expression contradicts the fact that s;(-) is a best response in the 
Bayesian game induced by (M,p’'(-)). Thus we have established the 
result. O 


One cautionary note is in order. We focus only on the existence 
of a mechanism which implements a choice function as the outcome 
of Bayesian Nash Equilibrium. Clearly, there may be other equilib- 
ria to the Bayesian game mechanism that result in different collective 
choices. 

Given the revelation principal, the question of whether a particular 
choice function is implementable can be answered by focusing on truth- 
ful direct mechanisms. If a choice function cannot be implemented by 
such a mechanism, it cannot be implemented by any mechanism. 

We now consider a few examples before returning to the develop- 
ment of the theory. 


2. Applications 


2.1. Polling. Suppose that there are N = {1,2,.n} (nm odd —not 
surprisingly) voters with symmetric single-peaked preferences on R?. 
The mechanism designer does not know the agent’s ideal points but she 
may ask each voter a question and then select a policy x € R'. Each 
ideal point 6; is drawn from a distribution F(-) on R'. The natural 
question to ask is whether there are any mechanisms that induce the 
agents to reveal their ideal points in dominant strategies. The answer 
is yes. Consider the mechanism that asks each agent to announce 
their ideal point m; € R' and then chooses policy equal to the median 
announcement, z(m) = median(m). To see that truthful response 
is a best response, consider a respondent with ideal point y;. Let 
x(m;,m_;) denote the median of the profile of messages that includes 
m, and the responses of the other n — 1 respondents. Further, let 
x(m_;) and 7(m_,;) be the lower and upper of median(m_;).” 

Suppose that agent i reports y;. If y; < x(m_;), then x(m_;) be- 
comes the median report so that z(y;,m_;) = x(m_,). Similarly, if 
yi > B(m_i), c(yi,mM_1) = Z(m_). Finally, if y; € [xz(m_,), Z(m_,)], 
yi is the median report so that x(y;,m_i) = yi. Thus, the deviation 
to y; can result in only three types of outcomes x(m_;),%(m_;),and 
yi € [x(m_4), F(m—)] . 

Clearly, the defection cannot pay in the state of the world where 
if 6; € [x(m_,),Z(m_,)] since she obtains her ideal point by reporting 
m; = 0;. So suppose that 0; < x(m_;). Now agent 7’s best feasible 


Recall that if a set of real numbers has an even number of distinct elements, 
it has two medians. 
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outcome is x(m_j;) which can be obtained by any message less that 
x(m_j;) which includes 6;. Similarly, if 0; > Z(m_;), announcing m; = 
6; weakly maximizes her utility. We have thus seen that regardless of 
the responses of the other players, agent 2’s best strategy is to announce 
Mm; = Ye 

Relating this example back to the Gibbard-Satherwaite Theorem 
of chapter 4, Herve Moulin (1980) obtained a mechanism design ana- 
logue to results about the existence of preference aggregation rules 
that do not violate the Arrow’s conditions when preference are single- 
peaked. Moulin focused on mechanisms in which respondents are asked 
to announce a number (interpreted as their ideal point) and set up a 
small number of criterion, anonymity and efficiency. The first condi- 
tion anonymity requires, simply, that the mechanism treat individuals 
identically. 


DEFINITION 11.2. A mechanism g : R" — X is anonymous if for 
any permutation m : N — N, g(ma,.., Ma, -., Mn) = g(Ma(1), +) Ma(s)) --) Ma(n))- 


The efficiency condition is essentially identical to Arrow’s Pareto 
condition. 


DEFINITION 11.3. A mechanism g : R" — X is efficient, if for 
every profile of types y = (Y1,--,Yn) g(y) ts Pareto efficient, that is 
there is not some other policy z # g(y) such that every agent weakly 
prefers y to g(y) and some agent strictly prefers y to g(y). 


Moulin showed the following result. 


PROPOSITION 11.2. If preferences are single-peaked then every ef- 
ficient, and anonymous, strategy proof mechanism is of the following 
form: Take the announcements m,,™Mo, ....M, and add k fixed numbers 
,..-,4z, and select the median of this longer list, x(m,a). 


We have already sketched out the argument for why such a mech- 
anism is strategy-proof. The proof of Moulin’s result is left as an 
exercise. 


2.2. Auctions. Because of the role of auctions in allocating every- 
thing from broadcast spectra to chatchas on E-Bay, economists have 
developed large body of theory both on how optimally structure auc- 
tions to generate maximal revenue and to achieve allocative efficiency. 
While political scientists are generally not concerned with those issues, 
we present some of the basics of auction theory for a couple of reasons. 
First, several important aspects of politics can be thought of as par- 
ticular types of auctions. Second, auction theory demonstrates how 
mechanism design can be used to study the choice of institutions. 
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However, it is not that auction theory has been ignored in political 
science. A classic application of auction theory are models of favor- 
buying in which competing interest groups offer bribes to secure public 
contracts or policy concessions from politicians. Recently one of us 
has argued for modeling electoral competition as a form of an auction 
(Meirowitz 2004). 

The standard auction problem involves a seller (agent 0) and a 
population of potential bidders, N = {1,..,n}. Each bidder places 
a valuation of 6; € Ri on the item to be sold. These valuations 
are the private information of each buyer. We assume for simplicity 
that the common prior is that bidder valuations are independent and 
identical draws from a twice differentiable distribution function F'(-). 
The utility to a bidder 7 that has valuation 6; and wins the item by 
paying 0; is just 0; — b;.. The payoff to losing bidders is 0. We begin 
with a few commonly studied auction mechanisms. 

2.2.1. Second Price and Ascending Price Auctions. Consider two 
auction designs. In a second price auction, each participant submits a 
sealed bid b; and the one who bids the highest wins the object. How- 
ever, the winner the amount equal to the second highest bid. In an 
ascending price auction, all participants begin with their placards up, 
and the auctioneer announces an ascending sequence of prices, $10, 
sill eee 2 ere When a price is announced causing the second to last 
placard to fall, the item is sold to that bidder with the upright placard 
at the announced price. 

Both of these auctions are commonly used and the experience of 
bidding under these schemes may seem quite different. Nonetheless, 
from a game-theoretic perspective the mechanisms are identical. Note 
that the auctions induce different games (one is a Bayesian game with 
simultaneous moves and the other is a Bayesian game with sequential 
moves). Nonetheless, the incentives are identical. To see this imagine 
that each bidder entered their valuation 6; into a computer and the 
computer did two things: (1) it submitted the valuations into a second 
price auction, i.e. it uses strategy b; = 0;. (2) it has a robot play the 
following strategy in an ascending price auction -hold the placard up 
until the announced price exceeds the valuation, #;. In both cases 
the bidder with the highest valuation would win the item and pay the 
second highest price. 

It is easy to see that in the second price auction bidding b; = 0; 
is a dominant strategy. Consider any other bid b; 4 6;.. Let b™* = 
max,;4;b; denote the highest bid submitted by all the other agents. 
There are three possibilities, either b™’* = 6;, b™* < 0; or D™* > O;. 
In the first case use of the strategy b; = 0; will result in a ties (which 
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has expected utility equal to 0). The strategy b; > 0; will result in 
victory but at the price b™* = 6; which results in utility of 0. Finally 
under the strategy b; < 6; you do not win the item and receive utility 
of 0. In the second case, under the strategy of b; = 6; you win and 
pay b™** receiving payoff 6; — b™’* > 0. Under the strategy b; > 0; 
you win and also pay b™* again receiving payoff 0; — b™*. Under the 
strategy of b; < 0; you do not win and receive a payoff of 0. In the last 
case, under the strategy of b; = 0; you do not win and receive payoff 
of 0. Under the strategy b; > 0; you either win and pay b™?* for an 
item that you only value at 6; < b™?* and thus you receive a negative 
payoff, or you do not win and receive a payoff of 0. Thus the strategy 
of b; = 0; does at least as well as any other strategy and avoids two 
types of regret: not winning an item at a price you would be willing to 
pay and winning an item at a price that you do not want to pay. 

Similarly, lowering the placard once the price exceeds 6; is a dom- 
inate strategy in the ascending price auction. Is there a reason to 
lower the placard before the price reaches 6;? Doing so insures that 
you do not win the item, thus this deviation results in a payoff of 0. 
Thus,early resignation cannot do better than the conjectured strategy 
of lowering the placard at a price equal to 6;._ Moreover, if it is the 
case that all other agents will lower the placard by a price p’ < 6; than 
the conjectured strategy would result in victory at a price of p’ and 
utility 0; — p;. Now consider the potential consequences of deviating 
to a strategy of keeping the placard up after the price exceeds 0;._ In 
this case either you are not the winner and you receive a utility of 0 or 
you are the winner and you pay a price p” > 6; to win an object worth 
0; resulting in a negative payoff. 

We now consider different types of auctions. 

2.2.2. First Price and Descending Price Auctions*. In a first price 
auction, agents simultaneously submit bids and the highest bidder wins 
the object and pays her bid. In a descending price auction each bidder 
has a placard and the auctioneer begins with a high price and lowers 
it as the auction progresses $100, $99, $98,.... The bidder that raises 
her placard first wins the item and pays the most recent price. As 
before the simultaneous move and extensive form versions of the game 
are similar. We focus only on the analysis of the first price auction, 
leaving as an exercise the construction of a subgame perfect equilibrium 
to the descending price auction. We follow Krishna (2002) and present 
an informal derivation of the equilibrium strategies. We begin with a 
conjecture about equilibrium strategies and verify that the conjecture 
turns out to be consistent with a PBE. 
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Suppose that bidders 7 4 i/have strategies that are represented by 
the differentiable and strictly increasing function b(#). Since a bidders 
expected utility is 0 if she pays her valuation or loses given the random 
variable b™** the expected utility to a bidder with type 6; from using 
strategy 5; is 

pr (bm < b;) (0; _ bil : 
Since we have assumed that the other bidders use the strategy b(@), 
the term pr(b™* < b;) is just the probability that n — 1 draws of 6; 
from F'(-) each have values lower than b~'(b;). Given the independence 
of types the expression for this probability is just F'(b~'(b;))"~1. Ac- 
cordingly, the expected utility is 


F(b-"(b;))"™* (8: — bi) . 
An optimal b; must solve the first order necessary condition 
_ oe lee db—'(b; 
(n FO") F000) 
In the conjectured equilibrium b(6;) = b; and thus we have 


4c _ d() 
(n — 1) F(A)" “f(0)0 = 7 


Note that the right hand side is 4 (F'(0)"~1b(0)) allows us to re-express 
the above as 


[A; — bi] — F(O*(8;))"* = 0 
F(@)"* + (n — 1) FO)" f (8) (8) 


d 

dé 
The first theorem of calculus allows us to re-express this as 
fo O(n — 1) F(O')"- f(0')a6" 

Ege 

which is the conditional expectation of b™?* given that b™* < 6. Verify- 
ing that this is in fact an equilibrium amounts to determining whether 
the local extrema characterized by equation 11.1 is in fact a global 
maximum. We leave this as an exercise. 

2.2.3. The Revenue Equivalence Principle*. We see that the first 
price and second price auctions result in different strategies. Given 
this, a natural question is which auction the seller would prefer to use. 
This question is answered by a fundamental result in auction theory, the 
revelation equivalence principal (Riley and Samuelson 1981; Myerson 
1981). This principal guarantees that when the types are independent 
the expected revenue of an auction depends only on the probability 
that a seller wins as a function of her type 6; and the utility realized by 
the seller from the lowest possible type. An immediate consequence of 


(F(0)""*0(8)) = (n — 1)F(0)" * f (0)0 


(11.1) b(0) = 
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this result is that, assuming independent evaluations, all of the above 
auctions yield the same expected revenue to the seller. In this section 
we develop this logic. 

Returning to the notation from the beginning of this chapter, let 
M = O denote the space of possible messages. We assume that each of 
the n players have a valuation which is an independent draw from the 
differentiable distribution F'(-) on © with density f(-). For convenience 
we assume that © is an interval of the form [0,k]. By A(N) we 
denote the set of lotteries over the bidders N. Let w(m1,....,™mn) : 
M” > A(N) and t(mi, ....,7%n) : M” — R% denote a mechanism which 
specifies for each profile m of messages a lottery over the identity of 
the winner and a profile of transfers. Thus, the first price auction has 
winner 


_ f lift =argmax{m,} 
w(m) = { 0 otherwise 


and transfers 
ay _ J m if i = argmax{m,} 
ti(m) = { 0 otherwise. 

An all-pay, first-price auction (which we analyze below) has t;(m) = 
m,;. The subject of the revenue equivalence result is that the expected 
revenue generated by Bayesian Nash play in the auction. Standard 
auctions have the “winner takes it with certainty” mapping g(m) de- 
fined above. Given an increasing bid function 6(0;) the expected 
revenue is ER = )>7, Gi t;(b(0;) )dF'(6;). We can now state the result. 


THEOREM 11.1. Assume valuations are independently and identi- 
cally distributed. In every standard auction, every symmetric and 
increasing equilibrium in which the expected payment of a bidder with 
value 0 is 0 yields the same value of ER. 


PROOF. We consider a fixed standard auction, and symmetric and 
increasing equilibrium characterized by the function b(-).Let t*(6;) de- 
note the expected payment of a bidder with value 0;. Suppose that 
bidder i’s valuation is 6, and consider the bid z = b(6/). When all other 
bidders use the strategy b(-) the expected utility of bid z to bidder 7 is 


OF (Gj) — (67) 


Differentiating this expected utility with respect to the type 6,’ yields 
the first order condition 


(n— 1)64 (0%)? = 
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Since b(-) is an equilibrium it must be the case that z = b(0/) = b(64) 
solves this condition, yielding 
Oba) 


(n= 1)0% (00)? = |. 


But this differential equation yields the solution 


/ 


6; 
£8) = #00) + fn — aya f(e"- 2a, 
0 


Thus under the assumption that t*(0) = 0 we have t*(6/) = i (n— 
1), f(0,)"-2d6, which does not depend on the details of the auction. 
Since 


nak 
ER=S> | t(0(0:))aF (0) =n f e+) F(0_)a0, 
i=1 79 
this completes the proof. LC] 


2.2.4. Contests and All-Pay Auctions*. An all-pay auction requires 
that contestants pay their bids regardless of whether they win. Many 
contests take this form. Two examples of interest in political science 
are models of interest group influence in which special interests pay 
bribes and the group that makes the largest payment receives a pol- 
icy and electoral campaigns in which candidates compete for office by 
mounting costly campaigns. We develop this second application in 
some detail. We use the revenue equivalence principle to reach some 
interesting conclusions. 

Consider a pool of candidates, C = {1,2,....,c}. Each candidate 
has a non-negative real-valued type 6, € R}, that corresponds to their 
overall efficiency at governing. The social goal is to select the most 
efficient candidate p* = argmaxpec 9,. We assume that the quality 
types 8 = (0,....,0-) are private information — only candidate p knows 
§,. For simplicity we assume that types are independent and identical 
draws from the distribution F'(-) with density function f(-). Let M. = 
max{6;}j;4, denote the maximum type from c — 1 draws of @. Given 
0, the probability that M. < 6, is F.(0,) = F(0,)°*. Differentiating 
yields the density of M., f-(0,) = (c — 1)F (6p)? (9p). 

Our baseline model of an election involves two periods. In the 
first period each candidate simultaneously selects a level of campaign 
effort a,. In the second period the candidate with the highest level of 
effort wins office. The key assumption is that a candidate’s cost of 
campaign effort is decreasing in her governing efficiency. That is, we 
assume that if candidate 1 has a lower cost of campaign effort than 
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candidate 2, candidate 1 is likely to be a more effective leader than is 
candidate 2. For simplicity, we consider the case where campaigning 
costs are inversely proportional to efficiency, thus 6,, = ae Candidate 
payoffs are then 


1- i if dp > max;z,{a;} 
(11.2) Eu(dy, Op) = =a if ap < max;4p{aj} 


1 ap + = 
#{j:aj=ap } — a, if dp = max; zp{ aj} 


where the last line of the expression follows from assuming that ran- 
domization is used in case of ties. Note that multiplying each candi- 
dates’ utility function by 0, translates this payoff function into that of 
a standard all-pay auction where the prize has value 6, and bids have 
unitary cost.? We now characterize a symmetric pure strategy equi- 
librium in which a candidate’s effort level is a strictly increasing and 
differentiable function of her efficiency type @;. If each player is using 
a strictly increasing and differentiable strategy a(0) then candidate p 
with type 6, that selects effort a, has expected utility 


(11.3) 1 (Ap, 9») = F(a (ap)? — 
P 
In order for a, to be optimal it must solve the first order condition 
af = = da“! 1 
(11.4) (¢— I) F(a" (ap)? F(a") GF = F- 
Ap Op 


In a symmetric equilibrium it must be the case that a(9,) = ap, so that 
a~'(dp) = 6,. Thus the condition reduces to 
da" 1 


(11.5) day, = 6,(c¢ — 1)F(O,)°-?2f (Op) 


This means that 
da 


(11.6) do Oy(c — 1) F (Op) "Ff (8p). 
Integration yields the required solution, 
0 
(11.7) o(0) = | n(e— 1) F(a)" f(a)da 
0 


0 
= i tf or) de: 
0 
This function is strictly increasing in @ so it remains only to verify that 
this solution satisfies the sufficient second order condition. This result 


3See for example Milgrom and Weber (1985) for results on standard auction 
models. 
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follows form Theorem 2 of Krishna and Morgan (1997) so we do not 
reproduce the proof. 


PROPOSITION 11.3. In the basic model a symmetric equilibrium in 
which accumulations a(@) are strictly increasing in efficiency exists. 
In this equilibrium the best candidate p* = argmaxXpec(O,) is chosen 
with probability one. 


A few comments are in order. First a(6;) < 0;. 


CONCLUSION 1. The winning candidate achieves a strictly positive 
payoff. 
Second, as a function of c the effort strategies have derivative 
da(O : 
al -| a(c —1)In(F(2))F(a)° ?f (x)dx 
c 0 


which is negative as F(x) <1. Thus for c < c we have a.(0) > ac4i(0) 
where a,,(0) denotes the equilibrium effort function when c = n. 


CONCLUSION 2. Candidate efforts are decreasing in the number of 
candidates. 


Third, since the model is equivalent to a first-price all-pay auction 
with independent values and a(0) = 0, the revenue equivalence princi- 
ple proven above implies that he expected total amount of effort 


A= cf aleftear 


must be the same as the expected payment of the winner in a second 
price auction in which player values are drawn from f(-). This value 
is just the expected value of of the second highest of c draws from f(-) 
which is 


| CONGO Er Ore 
0 
which is increasing in c. 


CONCLUSION 3. The total effort is increasing in the number of can- 
didates 


It is reasonable to think that the social objective is maximization 
of 6,.. If in addition the actual effort of campaigning is viewed as 
wasteful then it is natural to think that the social objective involves 


a trade-off between increasing Eé@,. and decreasing Dy ae where 
p=1 
H is the expectations operator. Letting v(-) and u(-) denote twice 
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differentiable increasing functions with v’ < 0 and u” > 0, it is natural 
to consider a social welfare function of the form 


V, = Ev(6,+) — Eu (> fe) 


p=1 
While the all-pay auction aspect of campaigns does an excellent job 
selecting the best quality candidate, unlike a typical auction where the 
bidding is viewed as revenue to the seller, in the campaign context the 


costs S- we are not desirable. This perspective motivates a natural 
p=1 


question. How does one select good candidates while not inducing 
excessive wasteful campaigning? In the next section we demonstrate 
that one answer to this question involves imperfect voting—a phenom- 
ena that many believe is empirically reasonable. 

The baseline model assumes that elections are perfect screening 
devices selecting the candidate that selects the highest level of a. In 
reality voting is imperfect, and candidates likely recognize this. In this 
section we consider elections with a random decision. For simplicity 
assume that the candidate with the highest level a wins with probability 
q+ —t and that the remaining candidates win office with probability 
4, To be sure a large number of other models of probabilistic selection 
can be incorporated. In this setting the candidate payoffs are then 


Cc 


(11.8) Eu(dp, 6p) = 4 _ an if a, < max;zp{a,;} 


Cc 


Gps a if ap > maxjzp{ay} 
1— ap | 
Fajaay te — o, if ay = Maxjyp{ay} 


If each player is using a strictly increasing and differentiable strategy 
a(@) then candidate p with type 6, that selects accumulation a, has 
expected utility 


(11.9) (dp, Op, q) = qgF (a*(ap)) at ma 
In order for a, to be optimal it must solve the first order condition 


=1 aed pees da" _ 1 
(11.10) gle — 1) F(a" (ap) f(a" (ap) G— = 
ap Op 
In a symmetric equilibrium it must be the case that a(@,) = a», so that 
a~'(a») = 6,. Thus the condition reduces to 
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(11.11) = 
This means that 

da c—2 

do az Anq(c _ 1)F (6p) f (>). 


Integration yields the required solution, 


(11.12) 


(1143) a(6;q) = i xq(c —1)F(x)*"f (x)dx 
= | Blow as 
(11.14) = qa(9; 1). 


Accordingly, when the election is imperfect, each candidate shades 
down her effort and the efforts are proportional to the imperfection 
parameter q. 

For the remainder of this section we will focus on two candidate 
contests and consider the optimal level g € [0,1]. In this case the 
strategies are 


a(;q) -«f rf (x)da. 


The social welfare associated with q is 


(oe) 


vig=4 / v(x) 2F (a) f(x)dx + (1-4) 


0 


7 [ou (F [ steer) df (0) 


The optimal value q satisfies the first order condition 


u(x)2(1 — F(x)) f(x)dx 


o——_ 8 


ee 
S 
zy 
S 
ay 
Ss 
= 
8 
| 
seo 8 
S 
= 
aan 
| 
zy 
S 
= 
e 
= 
8 
| 
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The optimal value q* is less than 1 if 


and greater than 0 if 


o— 8 


v(0) F(a) f(a)de — if v(0)(1 — F(a) f(a)de > 


a (F [oo (ede w')) f(0)d0 


Accordingly, if v(-) is relatively flat and u(-) is relatively steep then 
then imperfect elections are efficiency enhancing. When v and wu are 
linear the first order condition does not depend on gq and the optimal g is 
either 1 or 0, as it is either efficient to maximize the probability that the 
best candidate serves or to minimize the campaigning costs depending 
on which effect has a larger marginal impact on social welfare. 


3. Incentive Compatibility and Individual Rationality 


In the contribution and polling examples, we considered mecha- 
nisms for which truthfully reporting one’s type was a best response. 
While the revelation principal tells us that focusing on direct mech- 
anisms will not limit the choice functions that we can implement, it 
is silent about what types of choice functions are implementable. In 
the next two sections we consider this question. From the revela- 
tion principal, we know that we need only focus on direct mechanisms. 
Consequently, the question becomes “which types of mechanisms will 
induce agents to be truthful?” In a direct mechanism agents are asked 
to report their private information type, and then this information is 
used to make a decision. Incentive compatibility is the requirement 
that given the mechanism and the belief that all other agents are being 
truthful, agent 7 prefers being truthful to lying. 

We begin with the general problem. Consider a problem with 
agents N, choice space X, type space 9, prior joint density function 
over types f(0), and state-contingent utility functions u;(7,0) for each 
i € N. A direct mechanism is a mapping p(@) : @ — X. In order for 
there to be a Bayesian Nash equilibrium with truthful strategies to the 
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game that the mechanism induces the following Incentive compatibility 
condition must be true: 


DEFINITION 11.4. (Incentive Compatibility condition) For every i € 
N and every 6; € 9; 
(IC) 


uu(P(B,s 6-1), 65) f-(0-s)d0-s > f uj(p(0;, Oi), 9:) f-i(0_;)d0_; 


O27 O_; 


for every 0, € Qj. 


Informally, the condition requires that truthful messages are a best 
response if everyone else is using truthful messages. We now focus on 
gaining additional leverage on the types of mechanisms p(-) that can 
satisfy this condition. The easiest way to ensure that the IC is satisfied 
is to make Hee uj;(p(m;, 0_;), 9;) f_s(O_,)d0_; constant in m,. 

Consider the following example. Let w is a random state vari- 
able that effects the players payoffs from various policies. Assume 
that n > 3 agents observe w so that 6; = w with probability. The 
mechanism designer’s job can implement a choice function x(w) which 
maps a profile of messages into policies x in the following simple type 
of mechanism 

Hors { rw’) if #{j7 EN :6;=w'} > n-1 
x(w*) otherwise 


where #{j € N : 6; =w’} denotes the number of individuals announc- 
ing 0; = w’ and w* is an arbitrary value of w. Since a single defection 
does not alter the policy choice, this mechanism satisfies incentive com- 
patibility. 

The mechanism in the above example, however, does not lead to 
a strong incentive to be truthful. In the auction example incentive 
compatibility is satisfied by a second price auction as (ignoring ties), 


= 6; — max; 6; ifm; > max; 6; 
us(p(mi, 0s), 0) = { 0 otherwise 
is constant in m, if m; > max, 6; and constant in m,; for m; < max; 6;. 
The non-constant part of the function jumps from 0 to 6; — max; 6; 
which is positive only ifm; > max, @;. In contrast the first price auction 
is not incentive compatible. 

To generate some intuition for incentive compatibility conditions 
we return to the coffee machine problem. Consider an arbitrary trans- 
fer schedule t;(m;.m_;) which maps a message profile into an amount 
charged to member. Let p(m) be a policy function which maps the 
message profile into the probability that coffee maker is purchased.. 
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Given this transfer schedule, the expected utility to department mem- 
ber 7 from announcement ™, is 


Eu(m,0_:) = / Cri Ey ee Cece parC an 


If we assume that p and ¢ are differentiable functions, incentive com- 
patibility requires that m; = 0; maximize Eu(m;,0_;). Therefore, 
we require that a local incentive compatibility condition based on the 
first-order condition for maximization: 
OEu(m;, 0;) 
Om; 


m,=0;— 0. 


Interchanging the order of integration and differentiation in our 
example leads to the condition 


a. | am, f i(O i)d0 a |mji=0; / am, fe i(O i)d0 a 
Intuitively, this means that an incentive compatible mechanism requires 
that the expected decrease in transfers associated with a slight under 
reporting of 6; is exactly offset by the reduction in the expected likeli- 
hood of coffee maker purchase—weighted by the value of the purchase 
6;. 

While incentive compatibility requires that players be willing to 
reveal their private information, we also require that participants be 
willing to play the game. Analysis of these constraints is somewhat 
simpler and ad hoc. The key intuition is players will rationally partici- 
pate only if she gets a higher payoff in equilibrium that her payoff from 
not participating. In some settings, these constraints can be trivial as 
it is reasonable to assume that agents have no choice but to participate. 
In settings where the constraints are taken seriously, it is necessary to 
be explicit about the value to players of not participating. Formally, 


mi=0; 


DEFINITION 11.5. (Individual Rationality constraint) if the value to 
a player from not participating in the mechanism is give by v;(0, 6_:) 
and the value to being truthful in a direct mechanism is u;(0;,0_;) then 


for all 0; € 0; for eachi € N. 


4. Constrained Mechanism Design 


Classical mechanism design allows the planner to commit to one of a 
large number of mechanisms. Incentive compatibility, and potentially 
individual rationality are the only constraints. In many settings of 
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political science the ability to commit is considered unreasonable. As 
such the choice of mechanisms is somewhat limited. Restrictions in- 
clude an inability to provide transfers, and limits on the potential com- 
mitments that can be made. The remainder of this section develops 
some examples that illustrate how political scientists use constrained 
mechanism design to address problems of institutional design. With 
few exceptions, the models in political science are within principal- 
agent paradigm. This means nothing more than the recognition that 
there is a principal or boss (or several) and an agent or subordinate (or 
several). The principal would like to have “good” agents do “good” 
jobs and the agents tend to have a desire to keep the principal in the 
dark about whether they are of the “good” type or doing a “good” 
job. The principal is generally assumed to have a limited number of 
possible instruments which he can control. In the language of mech- 
anism design, the principal is the planner and doing a “good” job or 
revealing whether one is a “good” type corresponds to selecting appro- 
priate messages in the context of a direct mechanism. The limited 
number of levers corresponds to constraints on the mechanism that 
can be enacted. Before turning to some interesting applications, we 
demonstrate these concepts in a simple delegation problem. 

Suppose there is a principal and two agents. Agents have one of 
two possible types 6; € {good, bad}. Each agent is a good type with 
probability 7. In addition if an agent is chosen to perform a task she 
can devote one of two levels of effort a; € {high, low}. We suppose that 
x; = high imposes a cost of c on the agent while a; = low is costless. 
The principal must select one of the two agents to perform a task and 
the chosen agent must decide which effort level to choose. The funda- 
mental question of principal agent models (or agency models) can 
then be phrased as follows. How can the principal design institutions 
to select a good type agent (if one exists) and induce the chosen agent 
to select high? The former aspect, designing institutions to select good 
types is often termed adverse selection. The latter aspect, design- 
ing institutions to create incentives for high effort is termed moral 
hazard. 

If the principal could observe a label on the agent’s shirt that in- 
dicated his type, or if the choice of effort were readily observable then 
we would say there is no monitoring or observability problem. In this 
case, things are not too challenging for our principal. She would simply 
hire only good types and fire them when they give low effort. More in- 
teresting are situations where agent types are private information and 
effort is imperfectly observed. Ferejohn (1986) models accountability 
in repeated elections as a principal agent problem. To clarify these 
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concepts, we sketch and work through a simplified version of Ferejohn’s 
model. 

Ferejohn focused only on the moral hazard problem, so for now we 
will ignore the different types. There are two identical parties that can 
serve in office. Suppose that the in government party selects an effort 
level, but the voters observe a noisy policy outcome x € {high, low} 
that has the following form. If a = high then x is high with probability 
q > 5 and low with probability 1 — q; if a = low then x is low with 
probability q and high with probability 1 — gq. Finally, suppose the 
government party receives a rent r from being in office each period and 
discounts the future with discount rate 6. The opposition party gets 
a payoff of zero for each period it is out of power. 

The voter gets to select a reelection rule specifying which values of 
x will result in the decision to reelect the incumbent and which values 
of x will result in removal. Can she select such a rule that creates 
incentives for the government to always select a = high? One simple 
rule would be to retain if x = high and replace otherwise. If the voter 
uses this rule in every period, then the government party faces a simple 
decision problem. If it selects a = high in the current period then her 
expected utility is 
(11.15) r—c+qdV; + (1 —@)6Vo 
where V is the value of the game starting next period if the party is 
in office at the beginning of next period, while Vo is value of the game 
starting next period if the party is out of office at the beginning of next 
period.’ If the government selects a = low then her expected utility is 
(11.16) r+(1—q)dV; + q6Vo 
If the voter’s rule works so that is the incumbent finds it best to select 
a = high in every period, then we also have 

Vr =r—ct+q6V7 + (1 — qg)dVo 
and 
Vo = (1 — g)6Vi + q6Vo 
which imply that 
r—c+cqd —qrd 


11.17 Ge en 
ano PS See aoe Ogee 
and 

(11.18) Ve a 


~ 966° — 8 = 995 £1 


4See chapter 3 for a discussion of Bellman equations and value functions. 
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A version of incentive compatibility (for all governments to select a = 
high, ) requires that equation 11.15 is no less than equation 11.16. We 
can write this incentive constraint as 


(11.19) = — 


(2qg—1)6 
Substituting equations 11.17 and 11.18 leads to the condition 


c(2q5" — 6? — 2q5 + 1) 
(2q — 1)d 


which is necessary in order for the voters rule to induce a; = high 
in every period. In other words if the exogenous parameters satisfy 
this inequality the moral hazard problem is solvable with the rule that 
retains only if x = high. In this setting the voters do not really need to 
commit to the rule because the rule represents an equilibrium strategy 
given the decision rule that the government officers are using as long 
as the voter prefers x = high to x = low. This is true because 
all governments will select a; = high and all governments have the 
same type (i.e. there is no adverse selection problem). Accordingly, 
if the above condition is satisfied then we have characterized a Nash 
equilibrium to the game between the two parties and the voter: If in 
office select a; = high and when deciding how to vote, retain only if 
T= oh. 

Now we consider an adverse selection version of the problem. Sup- 
pose the government does not select an effort level but her type is 
private information. For simplicity, we assume that each of the par- 
ties has a type that is high with probability 7 > 5. We assume these 
draws are independent. In each period the voter only observes x which 
takes the value high with probability z > 5 if the government is a high 
type and the value of low with probability 1 — z if the government is 
a high type. If the government is a low type then x = high with 
probability 1 — z and low with probability z. Once again the voter 
might try the simple rule: retain if « = high and replace otherwise. 
However, this rule runs the risk of throwing out quality governments 
who are temporarily unlucky. <A better rule uses information from 
previous periods. For any finite number of periods k; in which party 7 
was in office let h; denote the number of these periods in which x was 
high. It follows that in k; — h; of those periods we had x = low. The 
posterior probability that party 7 is of type high after h; realizations 
of high from k; trials is given by Bayes’ rule as 


mzm(1 — 2)Rirh 
mzhi(1 — z)h— + (1 — a) (1 — z) Piz 


(11.20) r—c—Téd+c> 


Pr(6; = high | kj, hi) = 
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An optimal rule is to keep the original incumbent until Pr(6; = high | 
k;,h;) falls below 7, the expected quality of party 7 before it serves. 
Then the voter should dump party j as soon as Pr(6; = high | ki, hi) > 
Pr(6; = high | k;,h;). Recalling the law of large numbers we see that 
this rule will eventually select the optimal party (they may both be 
optimal). 

This discussion of agency has been meant only as an introduction to 
the concepts. We now consider several applications of agency theory 
to the study of delegation. The basic concepts of trying to create 
incentives to limit moral hazard and trying to select the best agents 
are recurrent. 


4.1. A Model of Delegation to Bureaucrats. One of the key 
questions in the study of the modern administrative state is the trade- 
off between political control of an agency and the autonomy that an 
agency requires to apply its expertise to policy problems. Principal 
agent theory has been a natural approach to this question. Originally 
applied in politics to study when and how the U.S. congress delegated 
rule-making authority to regulatory agencies, principal agent applica- 
tions have spread to many other political systems and to many other 
types of political bodies such as political parties and international or- 
ganization. In this section, we consider a version of the model that 
Epstein and O’Halloran (1994) applied to the study of statutory dele- 
gation in the United States. 

Suppose that a legislature L is considering how much authority to 
delegate to a bureaucratic agency A. We assume that the policy space 
X is single dimensional and a subset of R. Each of the n legislators 
have quadratic policy preferences with ideal points 1; < .... < 1,. The 
agency is treated as a unitary actor with quadratic policy preferences 
and an ideal point a. 

To motivate why L would delegate policy making authority to A 
rather than set policy itself, we assume that the members of L are unin- 
formed about the consequences of various policy choices p € X whereas 
A is fully informed. We model this uncertainty just as Gilligan and 
Krehbiel do in their study of legislative committees. We assume that 
the policy outcome zx is a function of the policy choice p and an error 
term ¢. To keep things as simple as possible, let x = p—e. Each 
legislator has a common knowledge prior that ¢ is mean zero and is dis- 
tributed according to a distribution function F'(¢) with density f(e). 
The agency knows ¢ with certainty. Thus, this informational struc- 
ture captures the idea that bureaucrats are policy experts on whom 
legislators rely on to improve policy formulation. 
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Rather than solve for the optimal mechanism from the L’s perspec- 
tive, we limit the analysis to a very simple mechanism where L chooses 
a set of allowable policies P C X. We assume that the sanction against 
an agency who chooses p ¢ P are so large that it never chooses to do 
so. Therefore, given a set of allowable policies P and the state of the 
world ¢, A will choose p to maximize 


—~(a—p+e)? subject top € P 


Thus, we can see that whenever a+e € P, the agency can get her ideal 
outcome by choosing p=a+e. Whena+e ¢ P, A will choose the 
point in P closest to a+. 

Given A’s best response, we turn to the legislature’s choice of P. 
While in principle we can allow P to be any subset of X, the following 
result establishes that P will always be a closed interval [p,p] C X. 


PROPOSITION 11.4. P* will always be a closed interval [p, P| CX 


We leave the details of the proof to the reader, but here’s a hint. 
Suppose that P has a “hole” in it eg. P = [p, p'| U [p",p] where 
pte 
pep" <a+e< _p", she chooses p’. Thus, the policy outcome as a 
function of ¢ will appear as the solid line of Figure 11.1. It is easy to 
see the variance in the policy outcome can be lowered by moving p’ and 
p” closer together. Since all legislators are risk-adverse, they would 
want to reduce the variance so long as the expected policy outcome 
does not change. The hint is over. It is now up to the reader to show 
that it is always possible to move p’ and p” closer together without 
changing the mean policy outcome.” 


p' <p". Then whenever p' <a+e< , A chooses p’ and when 


Insert Figure 11.1 Here 


Given the proposition, it is clear that the legislature’s collective 
choice problem is to choose p and p.and the agency’s best response 
function is i 

pifate<p 
DS pifate>D 
a +e otherwise 


The solid line of Figure 11.1 gives this best response. 

A complication arises in modelling L’s collective choice in that 
it must decide over two dimensions, the lower bound and the upper 
bound. Fortunately, the reader can check that the Plott conditions 


5Of course, eliminating holes does not prove that the interval must be closed. 
Closedness is required to make A’s best response well-defined. 
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will hold so that the majority rule decision will be the P* preferred 
by the legislator with the median ideal point /,,.. To get started, note 
that given A’s best response, |; prefers the combination of p and p that 
maximizes 7 


| (I, —B+6)*f(e)de - fo — a)? f(e)de - fo —p+e)?fle)de 


The first order conditions are 


a 

ae (;-—Dte)f(e\de =0 
p-a 

ep 

i | i-p+2)f@ae=0 


Note that whenever 1; < a, os < 0 for any finite p. Thus, the optimal 


choice is p* = —oo. The intuition is that when 1; < a the agency 
always wants a higher policy than legislator i would want if she were 
informed. Thus, legislator 7 never finds it in her interest to constrain 
A from choosing low policies. Similarly, if 1; > a, - > 0 and p* = cw. 


PROPOSITION 11.5. The majority rule outcome for P* is the closed 
interval [p, P| preferred by the legislator with ideal point ly. 


Since the majority rule outcome is the median’s ideal statute, the 
delegation game becomes one between the agency and legislative me- 
dian. Thus, the allowable policies for the agency are given by the 
solutions: 


p* = —oo 
/ (Ge Oren 
pr—a 
if ly, <a and 
p =-oo 
p*—a 
i (Im — p* +e)f(e)de = 0 


Lider Oy 
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Having already discussed why the median legislator will not want to 
constrain the agency on one end of the spectrum, consider the intuition 
for the expressions for the other constraint. Consider the case where 
lm <a. We can re-write the expression for p* as 


= fle) 
i | Pap aa) 
p—a 
This condition implies that the expected outcome conditional on A 
being constrained has to be at the median legislator’s ideal point. If 
this expected outcome were greater than I,,,, the median could do better 
in expectation by further constraining A’s choice to generate lower 
policies. Similarly, we can write the condition for p* when I,, > a as 


; f(€) 
‘a= Jo “el arhaay 


To generate some more specific results, assume that ¢ is distributed 
= on [—E, E] so that F(e) = 2 and f(e) = x. Then if 
fe OO 


pase 
Lé=D =o, 


or 
D =2l,—a+E#. 


We can see that if the agency and the median get closer together (e.g. 
by raising J,,, or decreasing a) then the legislature will pass a more per- 
missive statute with a greater upper bound. Intuitively, the legislature 
will delegate more authority to an agency that shares its preferences. 
Also, when there is more policy uncertainty (e.g. FE is larger), the 
agency is granted more authority. Thus, when information asymme- 
tries are greater, the legislature will be more dependent on the informed 
agency to formulate policy.® 


4,2. Bureaucratic Capacity. One of the important assumptions 
of the Epstein and O’Halloran and most other models of delegation is 
that the agency can implement its policy choice perfectly without error. 
This may be a reasonable assumption for advanced democracies with 


6When Im > a, the solution is p* = 2lm—a—FE. The reader can verify that 
this lower bound is less restrictive when [,, and a are close together and when EF is 
large. 
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cadres of professional, highly trained bureaucrats, but it is far less 
applicable in many developing states and in earlier historical eras. 

To address this issue, Huber and McCarty (2004) develop a model 
in which bureaucracies vary in their capacity to implement policies. In 
that model, if A attempts to implement policy p, the resulting policy 
is p = p+w where w is an implementation error which is assumed to 
have mean 0 and variance o2. Bureaucracies with high capacity are 
assumed to be better able to implement policies and therefore have 
lower values 02. Conversely, low capacity bureaucracies implement 
policy with imprecision so that 0? is high. Let G(w) be the distribution 
function for w and g(w) be the associated density. 

Huber and McCarty embed this model of capacity into a delegation 
model very similar to that of Epstein and O’Halloran. The legislature 
would like to delegate to the agency because the agency is better in- 
formed about the consequences of various policy choices. As above, 
the agency knows ¢ but the legislature knows only that it is distributed 
uniformly on |—E, E]. They also assume that the members of L and 
A have quadratic preferences over policy space X. 

The legislature moves first and creates a statute specifying the set of 
admissible policies P = [p, P]- The agency then moves and attempts 
to implement policy p which results in policy p = p—w and outcome 
x = p—e. Huber and McCarty assume that compliance with the statute 
requires that p € P. Thus, even if the agency attempted to comply 
i.e. p € P, its implementation errors might lead to non-compliance. If 
p ¢ P, the agency incurs a cost 6 as a sanction for non-compliance.® 
Unlike the Epstein and O’Halloran model, the agency may actually 
choose to be non-compliant. Alternatively, non-compliance could be 
purely a result of implementation errors. Since it is assumed that L 
cannot observe p, the sanction for non-compliance must be the same 
regardless of the ultimate cause. Thus, A will be sanctioned when 
p—Ww >por when p—w <p. Given a choice of p, the probability of 
sanction is G(p — p) + 1— G(p—p). 

To facilitate the exposition, note that a number of features of the 
Epstein-O’Halloran model generalize to this model. First, it can be 
shown that the majority rule choice of P maximizes the utility of the 
legislator with ideal point /,,,.. We will assume throughout this section 


7A note to the industrious reader who refers to the original publication is in 
order. In Huber and McCarty, the political principal is a generic politician rather 
than a legislature. We have adjusted the nomenclature and notation to paralell 
that of our discusion of Epstein and O’Halloran. 

8 Actually, Huber and McCarty assume that non-compliance is detected prob- 
abilitisically. However, our simplification doesn’t alter the results. 
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that a > 1,,. Secondly, we can show in the current model that p* = —oo 
ihe eae oe = 

Given this setup and the assumption that a > 1,,, we can write A’s 
utility function as® 


(oe) 


= / (a—ptw+e)%g(w)dw — 5[G(p —p)] 


=—(a—pte) —o2,—5[G(p—D)] 
Her first order conditions for a maximum are 
2(a—p +e) — dg(p—p) =0 
The first term in this expression represents the marginal benefit of 
moving the expected policy closer to the agency’s ideal point a. The 
second term represents the net marginal cost of increasing the intended 
policy in terms of the probability of sanction. Increasing p increases the 
probability of p+w > p by g(p—D). Clearly, A will choose p to equate 
the marginal policy benefits with marginal sanction costs. To get an 
explicit solution for p*, Huber and McCarty assume that g(w) = ee 
This density is “tent-shaped” on the interval [—Q, 2] and implies that 
— es Thus, 2 represents a measure of bureaucratic incapacity. 

Given these assumptions about functional form, Figure 11.2 plots 
the marginal policy benefit curve and the marginal compliance costs for 
three values of as a function of p. The marginal benefit line is the bold 
downward sloping line. The marginal policy benefit is independent of 
the location of p, and declines as the Bureaucrat’s action approaches 
the Bureaucrat’s most-preferred action, a+ e¢. The marginal cost 
curves depend on the location of 7, and are depicted in Figure 11.2 by 
the three triangles centered at p,,D., and p,. These triangles represent 
the function 6g(p—Dp). If Dp were too high or too low, the marginal costs 
would be zero at the Bureaucrat’s ideal intended action, a+e¢. This 
would lead the Bureaucrat to choose its ideal action. For non-extreme 
statute’s, the Bureaucrat’s best response lies at the intersection of the 
marginal benefit curve with the relevant marginal cost curve.'” Given 


°The second line follows from a useful fact expected utilities of quadratic func- 
tions: (e+ 6)? f(6)d6 = (@ - E(6))? + var(@) 


—co 
!0McCarty and Huber assume that 0? > 6 which guaratees that there is a 
unique intersection of the marginal benefit and cost curves and it represents a 
global maximum. Given their interest in systems where bureacratic capcity and 
the ability to sanction non-compliance is low, this assumption seems reasonable. 
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P,, for example, the optimal action is pj. For any p > pj, the marginal 
policy benefits of increasing the policy action (toward the Bureaucrat’s 
most preferred) exceed the marginal compliance costs, and for any 
p < pj, the reduction in compliance costs of moving the action away 
from the Bureaucrat’s most-preferred action exceed the policy loses. 


Insert Figure 11.2 about here 


We can see from Figure 11.2 that the effect of changes in ) on 
the Bureaucrat’s best-response depends on whether the apex of the 
“compliance cost” triangle is to the left or right of the policy benefit line 
(i.e., to the left or right of Dyin Figure 11.2). For Dp, < D < py, p* > D. 
In this range, increases in D increase the marginal compliance costs of 
any p > Dp, inducing the Bureaucrat to move toward the Politician’s 
ideal point. At p,, however, this effect reverses. For D > Po, p* < BD, 
and increases in p decrease the marginal compliance cost of any p* < D, 
inducing the Bureaucrat to adjust his action closer to his ideal point. 
Consequently, the minimum action that the politicians can induce is 
given by p* (Dy). 

Huber and McCarty show that formal solution to the agent’s max- 
imization problem is 


ate ifp-—e<a-Q 
2ate)—o(B ; = 
pe a mee ifa-Q<p-—ex<a-—% 
io PU ifp—e<a+N<p-e<at+ 
ate ifp—e>at+Q 


A few features of A’s best response are worth noting. Notice that for 
extreme statutes (B < a—Q+<e or Pp > a+Q+<¢), A’s best response is to 
attempt to implement her ideal point. This is because if the statute is 
too lax or too constraining, the marginal compliance cost at A’s ideal 
policy is zero. Secondly, note that l,, can only induce policies in the the 
interval [a — 2 +ée,at el. He cannot induce a lower policy because 
A would comply less often under a more restrictive statute. Note that 
this minimum policy is increasing in 2. Thus, 1,,’s ability to control A 
decreases and capacity gets lower. The intuition is that a low capacity 
bureaucracy is non-compliant a large part of the time regardless of the 
policies it chooses and this probability of non-compliance is not very 
responsive to the agencies choices. Therefore, the agency will choose to 
implement policies closer to its ideal point since there is little additional 
penalty for doing so. Therefore, the model identifies one important 
effect of low bureaucratic capacity: bureaucrats are harder to control 
through statutes. 
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Given their interest in the bureaucratic politics of low-capacity sys- 
tem, Huber and McCarty focus on the optimal statute when bureau- 
cratic capacity is sufficiently low.'! They find that under these condi- 
tions the optimal statute is 


Just as the Epstein-O’Halloran model, the Huber-McCarty model pre- 
dicts that L will delegate more authority when FE is larger. However, 
its prediction about preference divergence is the exact opposition. In 
the Huber-McCarty model, the statute is more permissive when a and 
lin = 0 are further apart. This is a consequence of the fact that low 
capacity bureaucrats are more likely to defect to their ideal point in 
response to restrictive statutes. This defection is extremely costly to 
L when a is far from /,,.. Thus, L is willing to grant more latitude to 
extreme bureaucrats to give them stronger incentives to comply with 
the statute. Finally, the Huber-McCarty model makes another predic- 
tion at odds with standard models of bureaucratic delegation. Other 
models have shown that when ex post sanctions are high, the principal 
is willing to delegate more. In the Huber-McCarty model, high 6 is 
associated with a more restrictive statute. The rationale is straight- 
forward. High sanctions can induce even low capacity bureaucrats to 
comply. Thus, L no longer needs to grant more discretion solely to 
induce compliance. 


4.3. Some Generalizations. Most game theoretic treatments of 
delegation (included the ones outlined above) maintain a number of 
stylized assumptions. First, principals and agents are assumed to 
be risk averse. In fact in most applications, they are assumed to 
have quadratic preferences. Secondly, the policy space and random 
shock are assumed to be single dimensional. Finally, most models as- 
sume that policy outcomes are additive functions of policy choices and 
shocks. Bendor and Meirowitz (2004) note that while these simplify- 
ing assumptions allow us to specify parsimonious models of the issues 
at hand, they often come at a price in terms of generalizability. In 
particular, the assumptions have limited our ability to determine specif- 
ically which features are most important in the decision to delegate, 
the selection of and agent, and the choice of monitoring and control 
mechanisms. In this section, we sketch the key points of Bendor and 
Meirowitz’s argument. 


"Sufficiently low capacity" means Q > min {E 8, v5} 


2 @? 
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Assume that there is a single principal and n subordinates. All 
agents have ideal points in R? and without loss of generality the prin- 
cipals’s ideal point is assumed to be the 0 vector. Preferences over 
outcomes are assumed to be represented by a utility function of the 
form 


u;(x) = h(— |x — wll) 

where /i(-) is a strictly increasing continuous function, ||z|| is the Euclid- 
ean norm and y; is 7’s ideal vector in R?. Euclidean preference over a 
single dimensional policy is clearly a special case of this assumption. 
As in Epstein-O’Halloran and Huber-McCarty, Bendor and Meirowitz 
assume that the principal is less informed than the subordinates, but 
they allow for (1) arbitrary functional forms, and (2) heterogeneity in 
the uncertainty associated with different policy selections. Formally 
this is captured by assuming that for any policy, outcome x(p) is a ran- 
dom variable given by the conditional distribution F(x | p). The prin- 
cipal knows only the family of conditional distributions and informed 
subordinates know deterministic mappings from p into x. They assume 
that “perfect shock absorption” is possible in that an informed agent 
can implement any policy outcome x by choosing the appropriate p. 
Epstein and O’Halloran’s assumption that x = p+ is a special case 
of this assumption. However, because of the implementation shocks 
in McCarty and Huber, the shock absorption assumption holds only in 
expectation. 

Unlike Huber-McCarty model where bureaucratic capacity refers to 
variation in the ability of agents to implement their intended policies, 
Bendor and Meirowitz model capacity as variation in the agent’s exper- 
tise. They assume that with probability gq; agent 7 learns the random 
shock and can select p to attain any x. However, with probability 
1 —q,; subordinate 7 is uniformed, knowing no more than the principal. 

In their basic delegation model, the principal decides whether to 
delegate or not. If she does not delegate, she selects policy based 
on her priors about the policy shock. If she delegates, she selects an 
agent gives that agent complete discretion over the policy choice. The 
selected agent chooses policy p, and the game ends. We now highlight 
a few of the key findings. For now assume that all agents have high 
capacity, g; = 1 for alli € N. 


PROPOSITION 11.6. There exists a ball B(e,0) centered at 0 with 
radius € containing ideal points of the agents to whom the principal is 
willing to delegate. B(e,0) is known as the delegation set. 


The construction of B(e,0) is straightforward. If agent 7 is given 
control of policy, she would like to select a policy to enact the outcome 
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x =y;. Accordingly, the principal is willing to delegate to 7 only if the 
outcome y; is preferred to the best lottery that the principal can attain 
by selecting policy herself based on her priors. As long as the principal 
faces uncertainty, her utility of implementing policy herself, up, is less 
than the utility associated with reaching x = 0 with probability 1, h(0). 
This follows from the fact that x = 0 is the principal’s ideal point so 
that any lottery on R? has lower expected utility than getting her ideal 
policy x = 0 with certainty. So clearly, the principal will delegate to 
an agent with ideal point y; = 0. The set of ideal points that are 
sufficiently close to 0 to merit delegation solve the inequality 


h(— |lyill) S wo 


and given that h(-) is strictly increasing and continuous, for any value 
of up the set {y : h(— |ly||) < uo} is a closed ball. 

Since so little structure has been placed on h(-), we see that the 
informational rationale for delegation in spatial settings hinges only on 
the desire to avoid bad outcomes: risk preferences, dimensionality, or 
the nature of uncertainty are secondary issues. It is not difficult to see 
that relaxing the assumption that all agents are perfectly competent 
qi = 1 does not have a qualitative effect. However, as competency 
decreases the delegation set shrinks—if you are going to give authority 
to someone else that is not likely to know anything more than you, 
they had better have preferences very close to yours. 

However, if several agents have ideal points in the delegation set, the 
choice of who to delegate to can be subtle. The traditional literature 
has often stressed the ally principal which says that if the principal 
delegates, she will select agent whose preferences most closely match 
hers. With homogenous competence, g; the same for all i and the 
stylistic assumption that x = p — € (a multidimensional version of the 
assumption in Epstein-O’ Halloran) the ally principal holds. The proof 
is left as an exercise. 

However, with heterogeneity in q; or more general policy outcome 
function, the ally principal can fail. As an example of the first problem, 
consider the case of x = p—e and two agents with ||y;|| < |]y2||.. Would 
the principal ever choose to delegate to agent 2 instead of the more 
proximate agent 1? Bendor and Meirowitz show that if qg2 > q, then 
possibly yes. We can conclude only that the principal will never select 
an agent that is dominated by another agent in the sense that agent 7 
dominates agent j if ||y;|| < |ly,|| and g; > g; with one of the inequalities 
strict. A more subtle finding is that once we relax that assumption 
that x = p—e even if q, = qz agent 2 may still be chosen over agent 
1. The key to this possibility is that the uncertainty associated with 
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different policies need not be the same. In the general model it is 
possible that an uninformed agent 1’s most preferred policy, p1, results 
in more uncertainty than an uniformed agent 2’s most preferred policy, 
pz. This can be the case if attempting to enact certain types of outcomes 
(say ones far from the status quo) is harder and thus subject to larger 
possible errors than attempts to enact other types of outcomes (say 
ones close to the status quo). As an example suppose the policy and 
outcome spaces are R'. As an example of this point, consider two 
agents with ideal points y, = —1+ 6 and y. = 1. Thus, agent 1 is 
closer to the principal. Suppose that F(x | p) is as follows. If p >0 
then « = p+ .1 or x = p—.1 with equal probability and if p < 0 then 
x = pt+.2 or x = p—.2 with equal probability. In this case, either agent 
selects p = y; if they do not learn the shock. This means that agent 
1’s uniformed policy choice entails more outcome risk. Accordingly if 
the principal is risk averse meaning h(-) is strictly concave) and 6 is 
sufficiently small then the principal would prefer to delegate to agent 
2 as opposed to 1 even though her ideal point is closer. An example 
of this argument is left as an exercise. 

Another potential violation of the ally principal arises from free- 
riding among agents when information acquisition is costly. Suppose 
now that the shock can be learned at a cost c, by any agent or prin- 
cipal. If the cost is incurred, the shock is observed with probability 
one. Further suppose that the principal can select an agent giving her 
authority and then observe whether she invests the cost c to learn the 
shock. If the agent chooses not to learn then the principal can retake 
control and decide whether to invest c herself to learn the shock and 
select policy. Alternatively the principal can select policy in ignorance. 
In this setting it turns out that the delegation set is a multidimen- 
sional doughnut. If the principal delegates to an agent with an ideal 
point very close to her own, the agent has the choice of paying c to get 
the outcome utility h(0) or not investing, with the knowledge that the 
principal will then retake control and invest c herself. Thus, learning 
implies utility h(0) — c while free-riding yields utility h(— |ly;||). Thus 
for agents with ideal points closer than h~'(h(0) —c), free-riding on the 
principal is preferred. Accordingly the principal will not delegate to 
agents that are very close. Of course agents that are very distant will 
select undesirable outcomes. 


PROPOSITION 11.7. If information acquisition has cost c, then the 
delegation set consists of agents in the original delegation set with ideal 
points farther from 0 than d = h~1(h(0) —c) 
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Bendor and Meirowitz also consider the effect of competition by 
agents in settings where there are many agents and one principal. Sup- 
pose that the agents are perfectly competent and simultaneously an- 
nounce outcomes in X. The principal then selects an agent and the 
agent selects policy (knowing the shock) to enact the outcome she an- 
nounced. If the outcome space is one dimensional and there are agents 
on either side of the principal this game looks like Downsian competi- 
tion and in every equilibrium at least two agents announce that they 
will enact the principal’s ideal outcome. It turns out that this conclu- 
sion holds regardless of the dimensionality of the policy space. 


DEFINITION 11.6. We say that preferences satisfy diversity if either 
(1) there does not exist a vector s € X such that for alli € N x; = X48 
for some X; € R! or (i) if such a vector does exist then there must be 
two agents i and j with 4; > 0 and A; < 0. 


Diversity requires that either preferences are not colinear or if they 
are that there are agents on either side of the principal. 


PROPOSITION 11.8. If preferences satisfy diversity then in every 
equilibrium at least two agents commit to enacting x = 0 and the prin- 
cipal accepts one of these offers. 


It is clear that if one agent promises to enact 0 the commitments 
of the other agents are payoff irrelevant. So any strategy profile in 
which at least 2 agents make this commitment is an equilibrium. Now 
suppose that no agents are making this commitment. With at least 
two agents at least one of the agents can move the final outcome closer 
to her ideal point by committing to an outcome that is closer to the 
principals than the closest commitment of the remaining agents. There 
cannot be an equilibrium in which the principal does not get her ideal 
outcome. 


5. Mechanism Design and Signaling Games 


So far our presentation of mechanism design proceeded quite inde- 
pendently of our analysis of signaling. This isn’t surprising given that 
this is the way the literature has generally developed. However, we 
think much can be gained by exploring the connections between the 
two normally disjoint topics. 

We begin with a generic setup for a signaling model. Suppose 
that the sender’s (player s) type is 0 € O = [0,1] and the message 
space is M = [0,1]. It is common knowledge that 6 is drawn from 
a distribution F(-) on ©. Following the s’s message m € M, the 
receiver (player r) selects a policy p € X = [0,1]. Even without 
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specifying payoffs, we can use the concept of incentive compatibility to 
specify necessary conditions the existence of an equilibrium in which 
the senders message is fully-revealing such that m~'(m(6)) = 6. In 
an equilibrium in which m(6) is one-to-one, consistent beliefs must be 
concentrated at the correct 6 i.e. beliefs can be represented by the 
probability distribution 


B(@ | m') = ¢ 


Given a fully revealing message, sequential rationality by r requires 
that p(m) € P(@) = argmaxu,(p,6). Sequential rationality by s re- 
quires that she not have an incentive to mislead r by behaving as if 
her type were 6’ when it is 0”. Given the mapping p(m), this incentive 
compatibility condition is u(p(0"),0") > us(p(6’), 6") for all 0,0" € 
Q. Alternatively, the receiver’s best response requires u,(p(0”), 6”) > 
u,(p(6'), 0”) for all 6,0” € ©. Thus, a requirement of a separating 
equilibrium is that p(@) maximize both u,(p,@) and u,(p, 6). 


1 if 6 > m7*(m’) 
0 otherwise 


PROPOSITION 11.9. A separating PBE exists only if the preferences 
of the players are similar—specifically for every 0 € © it must be the case 
that {arg max,cx u,(p,0)} MN {arg maxpex Us(p, )} 1s non empty. 


Given this result, it is clear that truthful revelation in cheap talk 
signaling with one sender requires strong similarity between sender and 
receiver payoffs. As an example, recall our version of the open rule 
Gilligan-Krehbiel model from chapter 8. There we showed that the 
best response by F' to informative signals is p(—0) = @ and p(—0) =0 
. The proposition specifies that a separating equilibrium exists if and 
only if 


ur(9| — 9) > up(—6| — 8) 
up(—6|0) > up(0|9) 
uc(9| — 8) > uc(—6| — 8) 
uc(—O|9) > uc(|A) 


We know the first two inequalities holds since p(—0) = @ and p(—0) = 0 
so the crucial conditions are —c? > —(20 +c)? and —c? > —(26 — c)?. 
Note that these both hold if c < 6. This is exactly the condition of 
preference divergence that we derived before. 

Now we extend the general model so that there are two senders 1 
and 2 who each observe @ but have possibly different preferences. The 
question of whether there is a PBE in which the receiver learns 6 can 
be viewed as a type of mechanism design problem where the receiver’s 
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choice of a mapping p(m1,mz2) : M? — X is analogous to selecting a 
mechanism. In contrast to mechanism design, a PBE of a signaling 
game requires that the receivers decision be sequentially rational given 
consistent beliefs. Thus, we are not free to choose just any mechanism 
that satisfies the sender’s incentive compatibility conditions. However, 
Baron and Meirowitz (2004) show that in many cases the constraint 
that the receiver’s actions must be sequentially rational is not very 
limiting. 

Returning for the moment to the mechanism design problem, sup- 
pose the receiver wishes induce truthfulness by punishing the senders if 
their messages do not coincide. Suppose that there exists a bad policy 
p® which both senders like less that the receiver’s best response to any 
truthful pair of messages. Formally, p’ is defined so that for every 6 


(11.21) uy (arg max u,(p, 4), 8) > ur(p", 8) 
P 


uo(arg max u,(p,4), 0) > us(p", 4). 
pEx 


Given this definition of the p’, the following mechanism satisfies the 
incentive compatibility conditions for both senders to be truthful: 


arg Max,ex U;(p,m) if my = Me 
pla, M2) = { , p pani 

Given this policy function, condition 11.21 implies that sender 2’s best 
response to a truthful announcement by sender 1 is a truthful announce- 
ment to avoid the dreaded p’. <A similar argument applies for sender 
1’s incentive to be truthful. Within the mechanism design framework, 
the mere existence of p’ is enough to induce truthfulness regardless of 
the receiver’s utility from p®. However, in signaling models, we need 
to worry about whether choosing p’ is sequentially rational for the re- 
ceiver. Accordingly it must be the case that p? is an optimal policy 
for the receiver given the beliefs she forms at all information sets where 
m, # Mp. This suggests that our mechanism design trick will not be 
very compelling to those committed to the signaling tradition. But 
recall that weak consistency only constrains beliefs at information sets 
that occur with positive probability. In an equilibrium in which the 
senders are truthful m, 4 mz does not occur. Accordingly, satisfying 
sequential rationality and credibly committing to enact p? ifm, 4 mz is 
not that challenging. All that is required is that there exists some dis- 
tribution b°(-) on © such that p? € argmax, f u,(p,0)d b°(0). Adding 
a state 6° in which p? € arg max, u,(p, 6°) would suffice. From this 
argument we reach the following conclusion 
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THEOREM 11.2. (Baron and Meirowitz) With at least two senders 
that observe 6 as long as there is a policy p> € X satisfying condition 
11.21 and a distribution b’(-) on © such that p? € argmax, [ u,(p,@)d 
b°(0) a truthful PBE ezists. 


In response to Gilligan and Krehbiel’s (1989) work on heterogeneous 
legislative committees, Krishna and Morgan (2001) demonstrated that 
with two senders there are PBE in which the receiver learns the state 0. 
Their equilibrium did not hinge on a punishment policy p® but rather 
used out-of-equilibrium beliefs to rationalize a policy that punished 
any agent that would have an incentive to lie. Krehbiel 2001 criticized 
this approach on the grounds off the path responses to some messages 
were highly discontinuous and tended to move in the wrong direction. 
While in equilibrium, high policies are best reponses to low messages, 
the Krishna and Morgan’s PBE call for low policies in response to high 
out-of-equilibrium messages. 

However, Battaglini (2002) showed that if in fact the policy and 
state spaces are multidimensional (X = © = {0,1]*) then truthful 
equilibria can be constructed that do not depend on beliefs in such a 
peculiar manner. As an example of this model, consider a receiver and 
two senders also with Euclidean preferences over two dimensions. The 
receiver has an ideal point of (0,0), sender 1 has ideal point (1,0) and 
sender 2 has ideal point (0,1). In this model, each sender observes 
the shock @ € © perfectly, and the outcomes x are a random function 
of policy p where x = p+ 6. We denote a message by s € {1,2} by 
the vector m, = (m!,m?). Similarly outcomes and policies are vectors 
(x = (x',x*), p = (p',p”)). A particularly simple direct mechanism 
can be constructed once we realize that while neither sender has the 
same preferences as the receiver (and thus by Proposition 11.9 there 
is not a truthful equilibrium in the 1 sender game), each sender has 
the same preferences as the receiver over a particular dimension of the 
problem. Sender 1 and the receiver both want the second dimension 
of the outcome as close to 0 as possible and sender 2 and the receiver 
both want the first coordinate of the outcome as close to 0 as possible. 
Suppose that sender 1 with ideal point (1,0) is given complete control 
of the second coordinate, so that p' = —m} and that sender 2 with 
ideal point (0,1) is given complete control of the first coordinate, so 
that p? = —m3. Given this mechanism announcement of m,(@) = @ is 
a best response, and thus the mechanism is incentive compatible. Can 
this receiver strategy be supported in a PBE? If so we need to find 
weakly consistent beliefs for which this policy function is sequentially 
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rational. Since the mapping p(m) described above is a direct mech- 
anism at every information set that is reached Bayes’ rule results in 
concentrated beliefs 6 = m, = mg. It remains to specify beliefs for 
information sets in which m,; # m2 that make policy function p(m) 
sequentially rational for the receiver and make the truthful messages 
sequentially rational for the senders. One easy way to do this is to 
let the beliefs ignore m3, and m?. In other words the beliefs are con- 
centrated at 6 = (mj,m3). Battaglini shows that in the Euclidean 
preferences setting with at least 2 dimensions and 2 senders separating 
PBE exist as long as the ideal points of the 3 players are not on a line. 

So we have seen that under particular types of preferences with 2 
senders separating PBE exist. With three perfectly informed senders 
{1, 2,3} we do not need to make any assumptions about preferences 
as a particularly simple separating PBE can be characterized. Let 
p*(@) denote a selection from the correspondence arg max u,(p,@) and 
suppose that p* € p*(0) for some 0. Similar to above, suppose the 
receiver could commit to the following mechanism that depends on the 
messages 1, M2,73 


Oe { p*(0) if 0 =m, =m, for some i,j € {1, 2,3} 
p* otherwise , 

Then truthful messages are a best response because if 7 and j are 
truthful then k’s message is outcome inconsequential. Supporting this 
policy function as sequentially rational is easy. Any belief mapping 
that is concentrated on 6 if m, = mz = m3 = 6 is weakly consistent, 
and any beliefs that are concentrated at 6 if 9 = m; = m, for some 
i,j € {1, 2,3} make the policy mapping a best response for the receiver. 

The conclusion of Baron and Meirowitz is that any direct mecha- 
nism that (1) selects an optimal policy for the receiver following truth- 
ful messages and (2) following non-truthful messages selects a policy 
which is optimal given some belief on © can be supported in a PBE to 
the signaling game. 


6. Exercises 


EXERCISE 11.1. Suppose that instead wishing to maximize the wel- 
fare of her department, the department chair wanted to maximize her 
surplus (contributions minus expenditures on the espresso maker). Would 
she still want to implement the Groves-Clarke mechanism? 


EXERCISE 11.2. Suppose that a polling mechanism were used such 
that x(m) = +>>m, so that average ideal point is the implemented 


policy. Show that this mechanism is not strategy-proof. 
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EXERCISE 11.3. Prove Moulin’s result (Proposition 11.2). 


EXERCISE 11.4. Prove that the strategies described by equation 11.1 
form an equilibrium to the first price auction. 


EXERCISE 11.5. Demonstrate that use of the strategy, "Hold up 
placard until the price exceeds b(@)" constitutes an equilibrium in the 
descending price auction. 


EXERCISE 11.6. Find the expected revenue of a standard auction if 
F(-) is the uniform distribution on [0,1]. 


EXERCISE 11.7. Consider Ferejohn’s model with moral hazard. As- 
sume that equation 11.20 does not hold. Construct a mixed strategy 
equilibrium where the government sometimes chooses a = low and the 
voter always removes the government when x« = low and occasionally 
removes it when x = high. 


EXERCISE 11.8. Prove that in the Epstein-O’Halloran model P* will 
always be a closed interval [p, D Cx. 


EXERCISE 11.9. In the Epstein-O’Halloran model, show that the 
majority rule outcome for P* is that preferred by the legislator with 
ideal point lm. 


EXERCISE 11.10. In the Epstein-O’Halloran model, assume that € 
is distributed N(0,07). Compute the optimal statute P. How does P 


(2) 


o(@)" 


depend on lm, a, ando? Hint: E(ele <m) =—-oa 


EXERCISE 11.11. Augment the Esptein-O’Halloran model by assum- 
ing that governor G with ideal point g > lm appoints A (i.e. selects a) 
prior to L choosing P. Assume that A learns € but that G and L believe 
that € is distributed uniformly on |—E, E]. Show that the governor’s 
optimal appointment is a* € (lm,g). How does a* depend on ln», and 
E?. What if the governor appoints A after P is selected? 


EXERCISE 11.12. In the context of the Bendor-Meirowitz model, 
prove that {y : h(||y|]) < up} és a closed ball. 


EXERCISE 11.13. In the Bendor-Meirowitz model, show that if q; is 
the same for alli and x = p—e (with € having distribution F(-) on 
R¢), the ally principal holds. 


EXERCISE 11.14. In the Bendor-Meirowitz model, suppose that q, = 
q2 = 2, and that u(z) = —x?. Find the minimum value of 5 such that 


the principal prefers delegation to agent 2. 
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EXERCISE 11.15. Construct a fully separating PBE in a version of 
the Battaglini model in which sender 1 has ideal point (1,1) and sender 
2 has ideal point (0,1). 


CHAPTER 12 


Mathematical Appendix 


Mathematics can be thought of as a language constructed to facil- 
itate the derivation of logical propositions from basic axioms. Math- 
ematical propositions are “true” if and only if the underlying axioms 
and assumptions are “true”. Euclidean geometry is the set of all state- 
ments that are true if parallel lines never intersect. Other geometries 
are based on different axioms and generate other theorems. 

Mathematical arguments have two main advantages over forms of 
non-formal argumentation: 


e The basic assumptions of the proponent are more clearly laid 
out. This makes it clear which of the propositions are logically 
derived and which are assumed to be true. 


In many non-formal theories of politics, it is not clear what types 
of assumptions about individual and institutional behavior are being 
made to produce the empirical predictions. The result is often pre- 
dictions incompatible with any coherent theory of behavior. In math- 
ematical models, the “micro- foundations” are well specified. The 
foundations are no more or less appropriate when they are formalized. 
However, understanding of the relationships between foundations and 
findings is made easier by the use of this type of reasoning. 

e It is often easier to derive logical propositions within the frame- 
work of mathematics than outside it. 

While mathematics was designed and has evolved to economize on 
the production of logical arguments, English and other spoken lan- 
guages serve so many other purposes that their ability to produce log- 
ical argument is compromised. 


0.1. Mathematical Statements and Proofs. Just as any other 
language, the fundamental unit of mathematics is the statement, which 
for now we denote P. Here we review the types of mathematical 
statements that readers are likely to encounter. 

: Universal Statement: P is always true within a given mathe- 
matical system. 

Consider the following example of a universal statement. 
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: Let x be a real number. Vx, x < |x| where the symbol BV 
means “for all” or “for every”. 


To prove such a statement, we require it to be proven for a generic 
x where we can only use the properties common to every value of x. 


: Existential Statement: There are conditions under which P is 
true. 


The following is such an example 
: dx such that x = |x| where 4 means “there exist(s).” 


To prove an existential statement, we must only find a value of x 
in the given system for which P is true. Of course in this example, 
x > 0 is the needed condition. 

Mathematics also has very well defined procedures for verifying that 

a given statement is true. Now we consider the types of proofs that 
will be encountered in the text. 
Deduction. Proofs by deduction are those in which a statement is true 
because it is logically connected to a statement known or presupposed 
to be true. Suppose we know P to be true, then we can establish that 
Q is true is we can prove “if P, then Q” or P > Q. Obviously, this 
can take place via a number of steps like: 


P=>R 
R=>S 
oq 


Sometimes when we work it out in our mind, showing S > Q may 
be the first step. However, when communicating the logic of the proof, 
it should be written according in the order given above. Deduction 
can be used to prove both existential and universal statements. 

The Contrapositive. Sometimes it is easier to establish, P = Q by 
formulating it in terms of the negative statements ~“Q and ~P where ~ 
means “not”. It is logically true that ("Q > ~P) > (P= Q). 


EXAMPLE 12.1. If 7m is an odd number, then m is an odd number. 


Thus, P = {7m is odd}, Q = {m is odd} , ~P = {7m is even}, and 
~Q = {m is even}. We wish to show P => Q by showing “Q => “P. 
~Q => m= 2k for some integer k 


=> Tm = 7(2k) 

=> Tim =2(7k) 

=> 7m = 2n for some integer n 
ee 


Contradiction. One way is to prove that the statement P is true is to 
demonstrate that ~P is false. Proving a statement is false is quite easy 
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we only need to provided a counterexample. One counterexample will 
show ~ P to be false and that P is true. Note that this procedure works 
best when we are arguing against a universal statement or in favor of 
an existential statement. 


EXAMPLE 12.2. Let n be any integer and let P = {there exists n 
> 0 such n?+n+17 is not a prime number} or P = {An > 05 
n?+n-+17 is not a prime number} where J means “there exists” and 
> means “such that.” 


We can now construct ~P ={V n > 0, n?+n+17 is a prime number}. 
But ~P is false since n = 17 implies that n? +n +17 = 19-17 and is 
thus not prime. Thus, P is true. 

We may also establish ~P is false by deriving a series of implications 
from ~P that lead to a false statement. 

~P =n+1+417/n is not an integer for all n 

= 17/n is not an integer for all n 
= | is not an integer 

The final statement is obviously false. 


1. Sets and Functions 


1.1. Sets. A set is a collection of distinct objects be they numeric 
values or anything else. The objects of sets are called elements. We 
may denote sets both by enumerating them or by describing them. 
Suppose S is the set of all positive integers less than 10. We may 
write S as S = {1,2,3,4,5,6,7,8,9} or S = {nn is an integer and 
0 <n < 10}. The second statement is read as “S is the set of all 
numbers that are integers and are greater than 0 and less than 10.” 
Sets can either have a finite number of elements as above or an infinite 
such as J = {n |n is integer greater than 7} or J = {x | 0 < a < I}. 
Infinite sets are either countable or uncountable. J is countable since 
a integer can be associated with each element. J on the other hand 
is uncountable since it is impossible to associate an integer with each 
element. We can prove that J is uncountable as follows. Associate 
elements in J with integers such that x, is the first element, x2 is the 
second element and so forth. As long as 71 < %2 there exists an x that 
is an element of J such that 7; < x < x2 which is not associated with 
an integer. This is a contradiction. 


EXAMPLE 12.3. A rational number is one that can be written as a 
fraction p/q where p and q are integers. Prove that Q = {x | x is 
rational} is infinite but countable. 
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We may denote that some element belongs to a particular set with 
the symbol “€”. Therefore, the following are truthful statements: 3 € 
S,9 € I, .345678 € J,and .8 € Q. We can also designate which 
elements are not in a particular set with “€¢” so that 10 ¢ S,7 € J, 
0¢J, anda ¢Q. 


1.2. Set Relations and Operations. The following are some 
useful relationships between different sets. 

Equality: S; = Sy implies that x € S) if and only if x € Sy. In other 
words, if S; = Sy , S; and Sy contain exactly the same elements. 
Subset: S; is said to be a subset of S» or S; C Sy» if for all x € Sy 
, © € Sy. Thus, S; is a subset of S if all of the elements of S; are 
also in Sy. Note that if S,; = So, then S; C S_ and Sy C S;. We can 
also define a “proper” subset which rules out the possibility of equality. 
S, C So implies that for all x € S; , x € Sp and there exist y € S_ such 
that y ¢ S,. In other words, all elements of S; are in Sj but S> has 
additional elements not found in S;. We may use the symbols D and 
> to write such statements in the opposite order. Finally, the symbol 
Z means “not a subset of.” 

Disjoint: Two sets are said to be disjoint is they have no elements in 
common. Formally, S; and S» are disjoint if « € 5S, implies that 
x ¢ So. 

There is a special set @ known as the “null set” defined by the 
following properties: @ contains no elements and for all S, @ € S. 
Clearly, @ is the smallest possible set. There is also a largest set U 
such that for all S, S C U. We will call this set the universal set. 

A number of mathematical operations on sets will be useful. 
Unions: The union of two or more sets is the total set of elements 
contained by all of the sets. Formally, we write the union of S;, and 
Sp as S,U Sp = {x |x € Sjor x € Sy}. We can write the union of a 
large number of sets indexed by 7 as LJ) 5S; = S$; U S2U...U Sy. 

i=1 
Intersections: The intersection of two or more sets is the set of elements 
common to all of the sets. Formally, we write the intersection of S; and 
Sy as Sy So = {x |x € Sand x € Sy}. We can write the intersections 


of a large number of sets indexed by 7 as a eS Oil Vos ah) Oa dt 


i=1 
S, and Sp» are disjoint, S$; S_. = @. Since they have no elements in 
common, the only subset in the intersection must be the null set. 
Complements: If we have a universal set U and a subset S' we can define 
the complement of S as S°=U/S = {x |x €U and x ¢ S}. 
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Let A, B, C, and D be sets. Then the operations on these sets 
must satisfy the following properties.. 


: Communitive: AUB=BUAand ANB=BNA 

: Associative: (AU B) UC = AU(BUC) and (AN B)NC = 
AN(BNC) 

: Distributive: AU(BNC) = (AUB)N(AUC) and AN(BUC) = 
(AN B)U(ANC) 


1.3. Correspondences and Functions. Another way to relate 
two sets to one another is to specify which elements of each set “cor- 
respond” or “go with each other”. In general a correspondence is a 
rule, f , that links the elements of 5S; to Sy. Formally, we may write 
f : S; -— S_ where the set 5; is called the domain (or pre-image) set 
and 5S» is the range (or image) set. As an example, consider Figure 
12.1 where the correspondence f relates x € S; to elements y, z € So. 


Insert Figure 12.1 Here 


A function is a special type of correspondence which relates each 
element of the domain to a unique point of the range. So if the 
correspondence f : 5S; -— Sv» is actually a function, for every 7 € Sj 
it is the case that f(x) is a single element of Sz. For functions we 
drop one of the arrows and write f : S; — Sy. With a function, 
multiple points from the domain may map into the same point in the 
range. In Figure 12.2, the function relates x,w,v € S; to elements 
y, Z © S9..Since each element of the domain maps into a single element 
in range, we may without ambiguity write a function as y = f(x) for 
x € S; and y € Sy. We may also represent the function by a set of 
ordered pair such as {(y,x) | y = f(x) for all x € S, and y € Sb}. 


Insert Figure 12.2 Here 
Consider a function f : A > B. Two important properties are: 


: Injectivity: For all a; and az € A, f(a1) = f(a) if and only 
if a; = ay.. Each point in the range is associated with a single 
point in the domain. This property is also known as“one-to- 
one” 

: Surjectivity: Vb € B there exist a € A such that f(a) = b This 
property is also known as“on to” 


If a function has these two properties it is known as bijection and 
there exists and inverse function mapping points in B to points in A. 
We can write this inverse function as f~':B > A or f~'(b) =a for b € 
BandacaA. 
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EXAMPLE 12.4. y = 2x is a bijection. Since every y maps to a 
single x, we can write the inverse function f~'(x) = y/2. 


EXAMPLE 12.5. y = x? is not injective since x and —x produce the 
same y. However, if we restrict the domain to positive numbers, we 
can write f~\(x) = Jy 


2. The Real Number System 


The real number system R consists of all the integers as well as 
the rational numbers (ratios of integers) and the irrational numbers 
(numbers that are not the ratio of integers). The real number system 
is actually a set of numbers and some additional structure. The system 
includes two operators, + and x which map from R x R into R and 
a weak ordering > which is a subset of R x R. These are the 
familiar operators of addition and multiplication and the ordering is 
our old fried “greater than or equal to.” Axiomatically, the system 
is characterized by 14 axioms. For our purposes it is sufficient to 
highlight only a subset of these conditions. The Real number system 
is a field, which means that the operations + and x behave the way 
we expect them to: the order of addition (or multiplication) does 
not matter, multiplication is distributive (meaning that Vz,y,z € R 
x(y+z) =xy+ xz), multiplication and addition by 0 and 1 have the 
expected consequences and every number has a multiplicative inverse 
(so that « x + =1). In addition the real number system satisfies order 
axioms which insure that > behaves the way we expect it to. What is 
probably unfamiliar territory is one particular axiom of the real number 
system which we must emphasize. 


DEFINITION 12.1. Completeness axiom:For every non-empty sub- 
set S C R ?f there exists an upper bound b of S (meaning x € s => 
X <b) then there exists a least upper bound c (meaning c is an upper 
bound of S and if z is an upper bound of S thenc < z). In other 
words every set with an upper bound has a least upper bound. 


An example of a space that is not complete is R\Z where Z is 
the set of integers. In this space the set (0,1) has an upper bound 
(example 3) but it does not have a least upper bound. 


2.1. Limits of Real numbers. A sequence of real numbers {2 }°°, 
sometimes just denoted {x,} we mean an infinite list of real numbers. 
More precisely a sequence is a function which maps the counting num- 
bers (1, 2,3,...) into the real numbers. In this sense z,, is the value of 
this function evaluated at integer n. 
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DEFINITION 12.2. The number 1 € R is a limit of the sequence 
{tn} if for every ¢ > 0 there is an N such that for alln > N we have 
lan —l] <e. Ifl is a limit of the sequence {x,} we write | = limp. 


PROPOSITION 12.1. A sequence has at most one limit. 


Proof: Suppose otherwise, then a = limz, = b anda #0. Since 
a # b there exists some ¢ > 0 such that (1) |a—b| > 2e. Since 
a = lima, = 6 it must be the case that for some N ifn > N (2) 
|x, —a| <e and (3) |v, —b| <¢. Without loss of generality assume 
that a < bif x, <a < b then we have contradicted 1 or 3, ifa<b< rp 
then we have contradicted 1 and 2, if a < x, < b then 1 implies that 
either 2 or 3 are violated. One of these three cases must be true. 


DEFINITION 12.3. A sequence {x,} is a Cauchy sequence if for 
every € > 0 there is an N such that for alln,m > N we have |%y — %m| < 
E. 


PROPOSITION 12.2. A sequence has a limit if an only if it is a 
Cauchy sequence. 


DEFINITION 12.4. We say a sequence {x,} converges to infinity co 
(—oo) if for any b € R there is some N such that for alln > N we 
have Ln > (<)b. 


DEFINITION 12.5. The number | € R is a cluster point of the se- 
quence {x,} if for every « > 0 and every N there exists some n > N 
such that. |x, —1| <e. 


The sequence x, = —1” has two cluster points 1 and —1 but no 
limit. 

PROPOSITION 12.3. | is a cluster point of {x} if and only if it is 
the limit of a subsequence {x}. 


DEFINITION 12.6. The number 1 € R is the limit superior (limsup) 
of the sequence {x,,} if (1) for any e > 0 there is an N such that. for 
alln > N we have tr, <l+e, and (2) for any e > 0 and any N there 
exists some n > N we have x, >l—e. We write 1 = limsup x, 


DEFINITION 12.7. | is the limit inferior (liminf) of the sequence 
{ty} if l= —limsup(—z,). 


An alternative definition which may be more intuitive is, 


DEFINITION 12.8. The limsup is the greatest cluster point and the 
liminf is the least cluster point. 


306 12. MATHEMATICAL APPENDIX 


PROPOSITION 12.4. For any sequence {xz,} limsupz, > liminf x, 


and if equality holds then limz, exists and limz, = limsupz, = 
lim inf zp. 


3. Points and sets 


We now move beyond the real numbers and consider arbitrary 
spaces that are endowed with particular structures. These spaces are 
called metric spaces. Typically, we think about a notion of distance in 
R and specify that the distance between two points x and y is |x — y|. 
More generally, we can think about arbitrary spaces sets endowed with 
a distance function. 


DEFINITION 12.9. A metric space (X,d) is a set of points X anda 
distance function d(x, y): X x X > R, satisfying the conditions: 

1 day) 20 

2. dhe, y) = 0 if end only yf ey 

3. d(x, y) =d(y,x) for anyz,yEX 

3. d(x, z) < d(x, y)+d(y, z) for any x,y,z € X. 


In addition it is convenient to think about balls around points. So 
for a scaler ¢ > 0 and a point x € X we say the e-ball around z is 
B(a,e) = {y € X : d(x,y) < e}.There are two properties of sets that 
we are concerned with. 


DEFINITION 12.10. A set A C X is open if for every x € A there 
is some € > 0 such that. B(x,ce) C A. A set AC X is closed if its 
complement X\A is open. 


Out of convention, we think of X and @ as both open and closed. 
Several results about open and closed sets can be established. 


PROPOSITION 12.5. 1. IfO , and Og are open then O11 QOz2 is open. 


THEOREM 12.1. 2. Given a collection of open sets O, Obo,.... the 
set U;O; 1s open. 

3. If Cy and C2 are closed then Cy M C2 is closed. 

4. Given a collection of closed sets Cy, Co,.... the set N;C; is closed. 


PROoF. 1. Assume that O, and Op» are open and pick an arbitrary 
point 7 € O; MN Og. Since O, and Oy, are open there is some €1, €2 > 0 
such that. B(x,é1) C Oy and B(x,¢2) C O,. Letting ¢ = min{é,€2} 
we have B(x,¢) C O; and B(x,<) C Og implying that B(x, ¢) C ONO, 

2.Assume that O,, Oo,.... are open and pick an arbitrary point x € 
U;O;. Since x € O; for some i there is some ¢ > 0 such that. B(x,¢) C 
O;. But this means that B(x,¢) C U;O;. 
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3. and 4 follow from De Morgan’s laws X\{A U B} = {X\A} 
{X\B} and X\{AN B} = {X\A} U{X\B} 0 O 


It is not the case that the infinite intersections of open sets is open. 
An example is the collection of open sets (—+,+). Each such set is 
open but the intersection is just the set {0} which is not open. 

Another property of sets that surfaces is 


DEFINITION 12.11. A set A C X is bounded if there exists some 
finite scaler k such that for every x,y € A we have d(x,y) <k. 


If X is a subset of finite dimensional Euclidean space R" = {(21, 22, ... 


x; € R} the we have the following definition. 


DEFINITION 12.12. A set A C R"” is compact if A is closed and 
bounded. 


In arbitrary metric spaces a more general definition of compactness 
is needed. The more general (or Topological) definition of compactness 
deals with open covers. 


DEFINITION 12.13. Given a set A an open covering of A is a collec- 
tion of sets {Og}oco where © is an arbitrary index set and Og is open 
for every 0 € © such that. A C {Usce@Oe} (in other words if x € A 
then there is some @ € © such that. x € Og). 


A set is compact if every open covering has a finite sub covering. 


DEFINITION 12.14. A set A is compact if {Op}oco an open covering 
of A implies that for some finite set B C ©, {Oo}ocR is a covering of 
A. 


Given our metric space is also a field (so that + is defined) which 
is the case of R” we have the additional useful definition: 


DEFINITION 12.15. A set A is convex if for every x,y € A and 
every scaler X € [0,1] the point Ax + (1 — A)y is also in A. 


PROPOSITION 12.6. If Aj, Ag, .... is a collection of convex sets then 
,A; ts a convex set. 


PROOF. Since A; is convex for any x,y € A; and any weight we 
have \x + (1—A)y € A;. Since any x,y € N;A; must be in every A; 
we know that if x,y € N;A; then Ax + (1 — A)y € A; for every 7 and 
thus Ax + (1— A)y € 1; A;. O 


ays 
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A, Continuity of Functions 


We now consider two sets X and Y that are each subsets of metric 
spaces (not necessarily the same spaces). We refer to the metrics for 
these spaces as dx and dy. The motivating example is X = Y =R, 
but the following applies to any functions that map one metric space 
into another. Of central importance in analysis is continuity. 


DEFINITION 12.16. A function f : X — Y is continuous at x © X 
if for any ¢ > 0 there is some 6 > O such that. for all y © X with 
dx(x,y) < 6 it is the case that dy(f(x), f(y)) < ¢. A function is 
continuous if it is continuous at every point in its domain. 


Another way to say the same thing is offered by the following defi- 
nition. 


DEFINITION 12.17. A function f : X — Y is continuous if for every 
open set B CY the inverse image f~'(B) := {x € X : f(x) € B)} is 
open 


To clarify in this definition whether or not a set B C Y is open 
depends on the metric dy and whether or not an inverse image set 
f—'(B) is open depends on dx. Of particular interest are functions for 
which the range is a subset of the real line. These functions are called 
real-valued functions. For real valued functions another definition of 
continuity is often convenient. 


DEFINITION 12.18. Given a function f : X — R, the upper contour 
sets is a collection of sets of the form U, = {x € X : f(x) > a} for 
every a € R. The lower contour sets are the sets L, = {1 € X : 


f(@) Sa}. 


Continuity can be restated in terms of the contour sets. 


PROPOSITION 12.7. The function f : X — R is continuous if and 
only if all of the upper and lower contour sets are closed 


4.1. Extrema, Solutions and Fixed Points*. The following is 
a crucial result indicating sufficient conditions for optimization prob- 
lems to have solutions. 


THEOREM 12.2. If f : X — Y is continuous and X is compact and 
non-empty then there exists a point x* = arg maxy,cx{f(x)}. 


Another crucial result follows. 


THEOREM 12.3. (Bolzano Intermediate Value theorem) If F' : 
[a,b] — R is continuous with f(a) < y < f(b) for f(b) < y < flx)/ 
then there is ac € (a,b) with f(c) = y.‘ 
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PROOF. Consider the lower contour set of y, Ly = {x € [a,] : 
f(x) < y}. Now L, is non-empty as a € L,. This set is also bounded 
so by completeness it has a least upper bound. Call this point c. 
Either c € L, (that is f(c) < y) or c is a cluster point. Ifcisa 
cluster point then there is some sequence {x,,} of numbers in L, with 
limz, =c. Since f is continuous this implies that {f(x,)} converges 
to f(c). Since f(x,) < y for every n it is the case that f(c) < y. 
Thus we know that c € L,. Now consider the upper contour set of y 
U,. Now U, is non-empty as b € U,. This set is also bounded so by 
completeness it has a greatest lower bound. Call this point c. Either 
c € U, (that is f(c) > y) or c is a cluster point. If c is a cluster point 
then there is some sequence {z,,} of numbers in U, with limz, = c. 
Since f is continuous this implies that { f(x,,)} converges to f(c). Since 
f(tn) > y for every n it is the case that f(c) > y. Thus we know that 
céU,. Thusce U,L, implying that f(c) = y.1 0 


Of central importance to the analysis of games are fixed points. 


DEFINITION 12.19. Given a function f : X — X a fixed point is a 
point x € X such that. f(x) =a. 


A key result is Brouwer’s fixed point theorem. 


THEOREM 12.4. (Brouwer 1910) If X C R" is compact, convex and 
non-empty and f : X — X is continuous then it has a fixed point. 


While the proof for n > 1 is beyond the scope of this review, one can 
prove the one-dimensional version with the intermediate value theorem. 


PROPOSITION 12.8. If f : [a,b] — [a,b] ts continuous then it has a 
fixed point. 


PROOF. Define the function g(x) = f(x) —x. This is a continuous 
function from [a, b] into [a,b]. If for some a’, b’ € [a,b] we have g(a) > 0 
and g(b) < 0 or g(a) < 0 and g(b) > 0 then the intermediate value 
theorem implies that for some c € [a’,b'] C [a.b] we have g(c) = 0 so 
that f(c) =c and cis a fixed point. The remaining cases are g(x) > 0 
for all x € [a,b] or g(x) < 0 for all x € [a,b]. These cases involve 
f(x) > a for all x or g(x) < x for all x. But since b = sup{z € [a,b] } 
= sup{ f(x) : x € [a,b] and a = inf{z € [a, b]} = inf{ f(x) : x € [a,b]} 
this is not possible. CL] 


When X is a field the following condition is of relevant. 


DEFINITION 12.20. A function f : X — R with X a convex set is 
quasi-concave if the upper contour sets are conver. That is for every 
t € R and z,x' € X and every X € (0,1) it is the case that f(x) > t 
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and f(x’) >t implies f(Av + (1 —A)a’) >t. If the last inequality is 
always strict the function is strictly quasi concave. 


A useful property of quasi-concave objective functions is easily ob- 
tained. 


THEOREM 12.5. If X is conver and f : X — R is strictly quasi 
concave then arg max,cx{f(x)} contains at most one point. 


PROOF. By way of a contradiction assume otherwise, so that there 
are two distinct points x,y € argmax,cx{f(x)} and f(-) is strictly- 
quasi concave. This means that for \ € (0,1) it is the case that 
f(Ac + (1—A)y) > f(x) = fly) contradicting the fact that r,y € 
arg max,ex{f(x)}.m O 


5. Correspondences* 


Some important concepts about correspondences should also be 
considered. 


DEFINITION 12.21. A correspondence f : X —— Y is convex- 
valued if for each x € X the set f(x) is convex. 


Notions of continuity may also be extended to correspondences. 
First we define the upper and lower images. 


DEFINITION 12.22. The upper image of E C Y under f (denoted 
ft (EB), is defined by ft (FE) = {x EX: f(x) c E}. 


The upper image of a set EF is the set of points in X that map into 
subsets of EF. 


DEFINITION 12.23. The lower image of E C Y under f (denoted 
f- (2), is defined by f- (EF) = {x EX: f(x) NEF O}. 


The lower image of a set E is the set of points in X that map 
into sets that intersect EF. Just as continuity of functions pertains 
to properties of contour sets continuity of correspondences relates to 
properties of these image sets. 


DEFINITION 12.24. A correspondence f : X -— Y is upper hemi- 
continuous if for each x € X, whenever x € ft(E) for E an open set 
in Y there exists an open ball B(x,¢) with B(x,e) C fT(E). 


DEFINITION 12.25. A correspondence f :X —— Y is lower hemi- 
continuous if for each x € X, whenever x € f~(FE) for E an open set 
in Y there exists an open ball B(x,¢) with B(x,e) C fT(E). 
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DEFINITION 12.26. A correspondence f : X —— Y is-continuous 
if it is both upper and lower hemi-continuous. 


For most problems we care about we can settle with an alternative 
condition which is more intuitive then upper-hemi continuity. 


DEFINITION 12.27. A correspondence f : X —— Y is closed at 
reX iff, > 2, yn © fltn) and yr > y imply y € f(x). Ifa 
correspondence is closed at each point in its domain it is closed. 


PROPOSITION 12.9. If Y is compact then f : X —— Y is upper 
hem-continuous if and only if it is closed. 


The following result is an early version of what is called the 'Theo- 
rem of the maximum. Alternative versions exist, but the basic point 
for formal theory is clear, the solutions of well-behaved optimization 
problems respond smoothly to changes in parameters. 


THEOREM 12.6. (Berge 1997) Ifu: X — R is a continuous func- 
tion andl: Y >— X such that for each y € Y, T(y) 4 @ then 

(1) the function v: Y — R defined by v(y) = max{u(x) such that 
x €T(y)} is continuous and 

(2) the correspondence a: Y >— X defined by a(y) = arg max,zer(y) {u(x) } 
is upper hemi-continuous. 


Our final result is a generalization of Brouwer’s fixed point theorem 


THEOREM 12.7. (Kakutani 1941) Let A C R” be compact and con- 
ver and let f : A >— A be closed (or upper-hemi continuous) with 
non-empty and convex values then f has a fixed point. 


6. Calculus 


The above analysis results represent insights about problems that 
can be gained based only on knowledge of topological features (com- 
pactness, continuity) and convexity features. With more structure and 
the use of calculus finer conclusions can be reached through analysis. 
In this section we provide a quick review of basis concepts of calculus 
which will prove useful throughout the book. Readers are also referred 
to Gill (2004), Chiang (2004), and Simon and Blume (1994). 


6.1. Calculus in R'. Many of the questions we ask in empirical 
political science involve what happens to variable y when we change 
variable x. If the variables are related by a function so that y = 
f(x), the derivative allows us to describe and quantify the effects the 
variables have on one another. Suppose that y = f(z). What happens 
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if we increase x to x +h? The change in y per unit change in x is then 


given by 

Ay f(@+h)-f(z) 

Ac h 
which is just the slope of the line drawn from f(# +h) to f(x). The 
difficulty of this measure is that it depends on hf as as illustrated by 
the two heavy dotted lines corresponding to h; and hg in Figure 12.3. 
We would prefer a measure that does not depend on h and describes 
the behavior of the function as close to x as possible. Such a measure 


is 
dy 5 f(w+h) f(a) 
dx  h—0 h 
which is known as the derivative of f with respect to x. This is 
the solid heavy line in Figure 12.3. We also may use the notation 
f'(x). Note that while the numerator of this limit goes to zero, the 
denominator goes to infinity so it can converge to any value. There 
is no guarantee that such a limit exists. If it does exists, we say the 
function is differentiable. The limit cannot exist if f is not continuous 
at x, however if may be continuous and still not be differentiable. For 
example consider the function : f(x) = |x| which is continuous but not 
differentiable at x = 0. To see this, note that 


A) = 
lim ies Tg) )= f(z) = fal 
h—0 h h—o0 h 
Such a limit does not exist because a sequence of h < 0 converges to 
—1 while sequences with h > 0 converge to 1. 


Insert Figure 12.3 Here 


Since derivative is a measure of the rate of change in y given a 
change in x, we can use it to determine whether or not a function is 
increasing or decreasing. If f’(x) > 0, the function is increasing while 
if f(x) <0 the function is decreasing. 

6.1.1. Some Special Derivatives. Many ordinary functions have deriv- 
atives with well known forms. We now list those for reference. 

(1) Constant: If f(a) =c then f’(x) = 0. 
2) Linear: If f(#) = a9 + ai2 then f'(x) = ay. 
3) Polynomial: If f(r) = ax” then ; f'(z) = nar"! 
4) Exponential: If f(x) = e* then f’(x) = ae“ 
5) Natural logarithm: If f(7) = aln(ba); f'(x) = a/x 

6.1.2. Derivatives of Composite Functions. We can take derivatives 
of more complicated functions especially if we can break them down 
into composite functions say f(x) and g(x). The following rules help 
to compute such derivatives. 


—~S ~~ 
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(1) The Addition and Subtraction Rule: dit) — f't+g and 


—— = f! = g’ ee 
(2) The Product Rule: | =f'-g+f-g' 
f fl q 
(3) The Quotient Rule: as) =f at : 


(4) The Chain Rule: Let z = g(y) and y = f(x) that z = 
g(f (2)), then @ = #54 = f’ (x) 9 (y) 

6.1.3. Higher Derivatives. Since derivatives of f(a) (when they ex- 
ists) are themselves functions of 7, we can take derivatives of derivatives 
to learn more about the properties of the function. We represent the 

2, 
derivative of f’(x), or the second derivative of f(x) as os =) 
As before, if f” > 0, f’ is increasing and if f” < 0, f’ is decreasing. 
The second derivative can also tell us about the behavior of the original 
function. If 


e f’>0, f” > 0, then f(x) is increasing at an increasing rate 
e f’>0, f” <0, then f(z) is increasing at a decreasing rate 
e f’ <0, f” > 0, then f(x) is decreasing at a decreasing rate 
e f’ <0, f” <0, then f(z) is decreasing at an increasing rate 


Figure 12.4 plots a function that exhibits each of these properties 
at different ranges of x. 

In principal, we can take n order derivatives, provided that they 
exist. We denote these as 74 = f( (x). 


dx” 


Insert Figure 12.4 Here 


6.1.4. Maxima and Minima of Functions. Much of the mathemati- 
cal analysis in political game theory involves maximizing or minimizing 
functions. Voters maximize utility functions and politicians maximize 
votes. States minimize the number of deaths in combats. The deriva- 
tive is very handy in locating the local (as opposed to global) extrema 
of functions. 

Intuitively, the local maximum (minimum) is the point where the 
function ceases to increase (decrease) and begins to decrease (increase). 
Therefore, the derivative must equal zero unless the local extremum is 
global and located on the boundary of range. Figure 12.5 illustrates 
the distinctions between global and local maxima and minima and the 
intuition as to why derivatives must be zero at local extrema. How- 
ever, the derivative may be zero at a point that is not a maximum 
or a minimum as demonstrated by the function in Figure 12.6 which 
contains a “saddle point.” Thus, a second order condition must be 
satisfied to guarantee that a point satisfying the first order condition is 
indeed an extremum. Approaching a local maximum, the derivative is 
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positive and becomes negative after reaching it. So the derivative must 
be decreasing which means second derivative cannot be positive. Con- 
versely, at a local minimum the second derivative cannot be negative. 
At a saddle point, the second derivative is zero. 


Insert Figure 12.5 and 12.6 Here 


Some Formal Definitions. The following definitions will be useful. 
Let f : D— R then 


e f(x*) is a global maximum if f(x*) > f(x) for all x € D 

e f(x*) is a global minimum if f(2*) < f(x) for all « € D 

e f(*) is a local maximum for all « > 0, f(a*) > f(x) if |a* — 
t| <e 

e f(«*) isa local minimum for all ¢ > 0, f(a*) < f(x) if |a*-2a| < 
€ 


e If f(x*) is a maximum, then 2*is known as arg max f(x) 
D 
e If f(a*) is a minimum, then 2* is known as a argmin f(z) 
D 
e If f’(x*) =0, then x* is a critical point of f. 

6.1.5. Application: Bureaucratic Resource Allocations. A bureau- 
crat has a budget B to spend on two activities that contribute to 
the output of the agency. The output of the agency is given by 
O = ,/%1X%q where x; and £2 are the expenditures on activities 1 and 2 
respectively. Since B = x, +22, we can replace x2 with B—-, so that 
O = \/x,(B— 21). Now we wish to find the expenditure x, that max- 
imizes the agencies output. First, we will compute the critical values 
x} to look for local maxima. The derivative of the output function is 


i Ss (5B = vi) 
xj (B23) 


Setting O’ to 0, reveals that the only critical value is x} = 5B . To 
determine whether this is indeed a maximum, we must compute the 
second derivative and evaluate it at xj. The second derivative is O” = 


— (xi (B- w*))? —(4B- at)” (xi (B- wi) *), If we evaluate 


this second derivation at xj = $B, it reduces to O” = —2 <0. Thus, 
C= 5B is a local maximum and produces an output of 5B . It is easy 
to see that it is also a global maximum since O(xj) = $B is greater 
that O(0) = O(B) =0. 

6.1.6. Concavity and Convexity of Functions. Two important prop- 
erties of functions are concavity and convexity. To illustrate these 
concepts, consider Figure 12.7. A function that curves downward like 
f; is known as concave. We can verify it is concave if for any points 
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like v7, and 2, the line between f(x1) and f(x2) lies below the function 
between those two points. Formally,f : D — R is concave over the 
set D if and only if f(Av) + (1 — A)x2) > Af(a1) + 1 — A) f (x2) for all 
A € [0,1] and x1, x2 € D. 

Alternatively, a function that curves upward like f2 is convex. We 
can verify it is convex if for any points like x; and 22, the line between 
f (a1) and f(a) lies above the function between those two points. For- 
mally, f : D — R is convex over the set D if and only if f(Av1 + (1 — 
A)t2) < Af(ai) + (1 — A) f (x2) for all A € [0,1] and 71,72 € D. We 
can extend the definition to the case of strict concavity and convexity 
by replacing the weak inequalities with strict ones. 

Concave and convex functions are critical because of the following: 


e If f : D > R is concave and f’(2*) = 0, 2* is a global maxi- 
mum. 
e If f : D — Ris convex and f’(a*) = 0, z* is a global minimum. 


These statements are true because if f’ exists concavity implies 
f” <0 and convexity implies that f” > 0. 


Insert Figure 12.7 Here 


6.1.7. Integral Calculus. Let F(a) be a function such that F’(x) = 
f(x). Then we say that F' is the anti-derivative of f(x). We typically 
write anti-derivatives in terms of the indefinite integral: 


F(a)= ff (e)az 


The laws of differentiation lead to the following results (where C is an 
arbitrary constant). Check by differentiating the left side of each. 


: faf(a)dx =a f f (x) dx 


: [(ft+g)dz= f fdx+ f gdx 
ne hein: oe cinta 6 


n+l 
: fidr =Inz+C 
: fe"dz =e7+C 


: fel f'(a)dx =I +C 
: ff (a))" fe)de = 2" +6 


n+1 


al dz =In f(x) +C 


The most common use of the integral will be to measure the area 
under a function. If F' is the antiderivative of f, then the area under- 
neath f between points a and 6 is given by the definite integral 
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/ f (x) de = F(a) - F(®) 


Differentiation of the Definite Integral. The rules for differentiating 
definite integrals are: 


: a f(x)dx = —f(a) 
b(@) b 
ie see = f f'(#(a))Z2da + f (b(a)) 2 — f (a(a)) $4 


6.2. Calculus of Several Variables. Consider the function y = 
f (x). It is often useful; to know how y changes given a change in one 
of the elements of x. Typically, we will look at the partial effects of 
x;: that, is how a change in 2; effects y while assuming that the other 
elements of x’s are held fixed. This is equivalent to examining the 
behavior of the function within a given “slice”. Formally, the partial 
derivative is 
of _ |, f(x+h)- f(x) 


Ox;  h-0 h 


where h; is a vector of zeros except for an h in the i“ position. Partial 

derivatives are as easy to take as regular derivatives by virtue of the 

fact that we can treat all of the other variables as constants. 
EXAMPLE 12.6. Let f (41,22) = =. Then ofc 1 OF ~_m 


~~ 29 Ox, wQ’ Oxo x5 


We often write the collection of partial derivatives as the gradient 


vector 
Def = (5 OF =} 


O21’ 0x2’ ” OXn 


The gradient vector evaluated at x describes the behavior of the func- 
tion near x. 

6.2.1. Higher Order and Cross Partial Derivatives. Just as with 
functions of a single variable, we can take higher-order partial derivative 
to characterize the behavior of partial derivatives. The second partial 
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derivative with respect to x; is written as 


POTN, 200. 
Ox; \Ox;) Ox? 


v 


We can interpret exactly the same way as in the case of a single variable. 
However, the case of more than a single variable, we may want to 
know how derivatives change when other variables change. How does 
changing x, effect the partial derivative with respect to x;? We can 
write the cross partial derivative as 


O ory. 07F 
On; Ox; Ox ,0x; 


> Of _ Of Qa 
EXAMPLE 12.7. Let f (41, £2) = oe Then azz 0, azz a 
Of? Of — 1 
021022 98? and 02202, a3" 
af _ of +: af _ ef 
Note that 5-355 = dandaz- This is true generally as 5>- 


Ox ;,0x; ~~ Ox j;Ox;" 
Thus, the order of partial differentiation does not matter. 

Often we will denote the collection of second and cross derivatives 
in the form of the Hessian matrix: 


Of Of Of 
Ox? 0210x2 < 02102n 
Of Of orf 
H = 022021 8x2 a 0x202n 
AOI 5 Gefen Cal 
Orn021  O@nOTQ  *" Ox?, 


6.2.2. Implicit Function theorem*. Many equilibrium characteriza- 
tions involve finding a value of x € R” that solves a system like 


f(xy) =0 
for a particular value of the parameters y € R*. When a closed form 


solution for a solution x* exists we get an explicit relationship of the 
form 


x" = g(y). 
If g is a differentiable function than comparative statics analysis, (find- 
ing out how changes in y effect x) is straightforward. Sometimes 
however, we can prove that a solution x* exists for each y but we can- 
not directly solve for the function g(-). For example a fixed point 
theorem may tell us that a solution to the system 


f(xy) -x=0 


exists, but we may not be able to analytically solve for the vector x. 
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Under suitable conditions the implicit function theorem lets us im- 
plicitly characterize the derivative D,x*. We first present the result in 
the case of one endogenous and one exogenous variable. Let x* solve 
f(x,y) =0. 

PROPOSITION 12.10. (Implicit Function Theorem) Let x* solve the 
system at y*. If f(-,-) is continuously differentiable and ohe 2) 7. () 
then for some open set A containing x* and an open set B containing 
y* there exists a continuously differentiable function ¢6: B > A with 
f(o(y), y) = 0 and the derivative of this function at y* is given by 


& Of (x" y") 
Ooly ) es Oy 
Oy one) 


To present the result in the more general case, we consider endoge- 
nous vectors of the form x =(21,...,2n) € IR” and exogenous vectors of 
the form y = (y1,..., yz) € R*. Suppose the system f(x,y) = 0 is of 
the form 


flea, vey Try Ylyeeey Yk) =) 


fn Bis cas is ViGat UE) S O: 
The Jacobian matrix of this system with respect to the endogenous 
variables is then the n by n matrix that stacks the transpose of the 
Gradient vectors up 
Dsfi 
p= , 
Defi 
PROPOSITION 12.11. (Implicit Function Theorem) Given a pair 
(x*,y*) for which x* is a solution to the system at y*, if it is the 
case that fi(-) through f,(-) are continuously differentiable in each co- 
ordinate of x and y and the Jacobian matrix of the system with respect 
to the endogenous variables, is non singular, (i.e. the determinant of 
J is non-zero), then for some open set A containing x* and an open 
set B containing y* there exists a continuously differentiable function 
¢: BA with f(do(y),y) = 0 and the derivative of this function at 


* 


y* is given by, 


Dy oly") =— <= [Df (x*,¥*) Daf (x*,y*) 
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6.2.3. Optimization in R”. Recall that if we want to maximize f : 
R — R, we need to look for values of x for which f’ (x*) =0.' If this 
condition did not hold, some other x in a neighborhood of x* would 
produce a larger value of f(z). 

The same logic holds for optimizing multivariate functions. How- 
ever, now we need the derivative with respect to each element of x to be 
zero. Suppose this were not trues and that gh > 0. Then, the value 
of the function would increase for a small i erase in x; and decrease 
for a small decrease. Similarly, we cannot have oe < 0 at an interior 
optimum. 

Given this discussion, it is clear that a necessary condition for x* 
to optimize f : R”— R is that 


Df (x*) =0 


The second order conditions for maxima and minima are based on 
the Hessian matrix defined in the last section and require some more 
advanced concepts in matrix algebra.? However, we can state the 
conditions for R?. 


Deny tON 12.28. x* € R? is a local mavimum if Df (x*) = 0, 


2 
Oe ee 0,24 f< 0, and oot > (525) 


ae 12.29. x* € R? is a local minimum if Df (x*) = 0, 


2 
ot oes Of 
Ft > 0, 55 z > 0, and <4 > (x25) 


For a discussion of the general case, we refer the readers to Chiang 
(2004), Simon and Blume (1994). 

Example: Party Resource Allocations. Suppose a political party 
wants to allocate its funds across two elections. The party values each 
of these seats by W, and W, respectively (the party gets zero for each 
seat it loses). 
where x; is the amount of money it spends in election 7. The cost of 
spending 2; is simply z;._ Therefore, the party wishes choose (21, £2) 
to maximize: 


Ly 


2 
Set | | 2p ARE |, perme ae 
iia i 2—21— 2 


‘Since the domain is R, we need not worry about corner solutions. 

The sufficient condition for a maximum (minimum) is that H is positive 
(negative) definite. A NaN matrix M is positive definite if for all vectors 
v €R",v'Mv > 0. It is negative definite if v'Mv <0 for all v € R”. 
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The first order conditions are 


—1=0 
(1+ 2) 
Le es 
(1+ 22) 


while the Hessian is 


| __ Wi 0 | 
(+21) 
0 th, 

(1+21)° 
From the first order conditions, we can see that there are four possi- 
ble critical values: ( VW, -—1,-VW2 Ly, (JW, —1,-V/W2 - 1), 
(-JVW, — 1, VW, — 1), and (VW, — 1, JW, - 1). 

Note however that the second order conditions require aa 0) 
or x; > —1. Thus, since W; > 0, the only critical value that satisfies 
the second order condition is (VM —-1,J/W.- 1) . 

6.2.4. Concave and Convex Functions. The definitions of concavity 


and convexity that we have already encountered generalize easily to R”. 


DEFINITION 12.30. Let U C R" and f: U —R._ The function 
f is concave if for all x, y <U and X € [0,1], f(Ax+(1—-A)y) = 
Af (x) +(1—-A) f(y). 


DEFINITION 12.31. Let U C R" and f: U > R._ The function 
f is conver if for all x, y €U and X € [0,1], f(Ax+(1-A)y) < 
Af (x) + (1- A) fy). 


Just as before, concavity guarantees that the critical values generate 
global maxima while convexity guarantees global minima. 


THEOREM 12.8. Let f : U > R be be a twice differentiable function 
where U is an open and convex subset of R”. If f is a concave function 
on U and Df(x*) =0 for x* € U, then x* is a global maximum of f 
on U. If f is a convex function on U and Df(x*) = 0 for x* € U, 
then x* is a global minimum of f on U. 


6.2.5. Constrained Maximization. 

Equality Constraints. In a number of contexts in political game 
theory, it will be useful to solve constrained maximization problems. 
Such constraints may arise either because of feasibility constraints on 
agents choices or due to the behavior of other agents. Such problems 
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will be assumed to take the form of 
max f (x) subject to g (x) =0 
gi (x) =0 


gx (x) =0 
where the function f : R” — R? and each of the functions g; : R" > R" 
are assumed to be twice differentiable. Here we take a cook book 
approach demonstrating how to analyze problems of this type. 

The solution to this constrained optimization problem can be found 
by setting up a somewhat different unconstrained optimization and 
solving this translated problem. ‘The trick is to incorporate the con- 
straints as part of the objective function. 

The Langrangian 


L(x,A) = f (x) — yy X59; (X) 


represents this translated objective function. Note that it depends 
on both the choice variables x from our original problem and a new 
vector of k variables. These new variables are called the constraint 
multipliers (as each constraint gets its own multiplier). Note that 
we started with a real valued objective function and k constraints and 
translate the problem into an objective function that is the sum of k+1 
real valued functions. The first order conditions for optimization of 
the Lagrangian are 


F(X) a 09; (%) he 
ee 2 Ba, for each 7 = l,...,n 


9; (x) =0 for each j = 1,...,k 


Analysis of the first n conditions yields necessary conditions on x for 
a solution to the constrained problem. More formally, 


PROPOSITION 12.12. (Lagrangian Theorem) Assume that the gra- 
dient vectors of the k constraint functions are linearly independent vec- 
tors. If x* solves the constrained problem then there exists a vector of 
Lagrangian multipliers X € R* for which (x*,A) solve the above first 
order conditions. 


The motivation for translating the constrained problem to this un- 
constrained problem is best obtained by inspecting the first n first 
order conditions of the Lagrangian. They require that any increase 
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in the value of f obtained by changing x (from a solution to the first 
order conditions) results in a corresponding change in the value of at 
least one of the constraint functions g. In other words, if x solves the 
Lagrangian than any improvement in f would come at the expense of 
violating the constraint. Note that the independence requirement for 
the procedure to work is that the Jacobian of the constraints with re- 
spect to the variables x have rank k, (that is the gradient vectors of 


the k constraints are independent). Without this constraint qualifica- 
09; (x) 


tion condition it need not be the case that a change in ey Nia 


corresponds to a violation of the constraint. 

Application: Party Resource Allocations. Suppose now we assume 
that the party has a budget constraint that it must satisfy when allo- 
cating funds across districts. Now the party wishes to maximize: 


Ly XQ 
—— W. 
1l+2, ia. 


subject to B= 21+ 2% 


Tere ak wer Le = A (a1 +22 — B) while the first 
order conditions are —“, — \ = 0, “2, —\ =0, and 4, +22 = B. 


The Lagrangian is 


(1+21) (1+22) 
The first two conditions imply that 
Wo (1 oa i ta) Wo fel F (1+ 22) 


— = ——. or 
Wi (1424) W, (1+21) 

Together with the budget constraint, we have two equations and two 

unknowns. Using the positive roots, we have 4 fe = G+B- 1) which 


(1+21) 
— aioe 
: 14+B- we 1+ ue) (B-1 
imply that 77 = MB 1 and 7 = Hy me) ‘ 
1 W. 2 W: 
2 1 2 { 
wt wt 


Inequality Constraints. The problem considered above requires that 
the constraints are of the form g;(x) = 0. A larger class of opti- 
mization problems require only that a system of inequality or equality 
constraints be satisfied. The general problem is then 


max f (x) subject to g: (x) =0 
@(%)-= 0 tory = sak 
G(x) <0 fort = 1,2. 
Again we assume that all of the relevant functions are differentiable. 
The Kuhn-Tucker conditions are similar to the Lagrangian conditions 


save how the inequality constraints are treated. The relevant trans- 
lated first order conditions are 
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- a 2 Pee ) tox each i = l,. 


es ae aft 
Aig: (x) = 0 for each t = 1,...,w 


The difference is that for inequality constraints, either the constraint 
binds (in the sense that g; (x) = 0 or the multiplier A, is zero. 

The Envelope Theorem”*. In applications the objective function or 
the constraints may also depend on exogenous variables y = (yj, ..., Yi, -Yz) € 
R*. Consider the problem 


max f (x;y) subject to g: (x;y) =0 
9; (xy) =0 for j =1,..,k 
oe (xy) <0 fort =1,..,w 


By u(y) we denote the value function which is a mapping v : R* > R' 
with u(y) = f(x*(y);y) where x*(y) is a solution to the optimization 
problem above. The theorem of the maximum indicated that under 
suitable conditions the value function is continuous. We can use cal- 
culus to gain more insight into the dependence of the value function on 
the exogenous parameters. 


PROPOSITION 12.13. Assume that u(y’) is differentiable at y’ and 
that (x*(y’), \(y’)) solves the above problem and on some open set A 
containing x*(y’) and some open set B containing y' the set of con- 
straints ue bind on the solution x* : B > A is constant , then for 
edi eS Tes 


duly’) _ af a Og; (x “Og: (x vy) 
Dey Lay oy: dy . 


j=l 


The novelty of this result is that in characterizin ia 


need to worry about D,,x*(y). 
6.2.6. Multivariate Integrals. We can also calculate the area under 
multivariate functions with the use of the multivariate definite integral 


by bn 


- fs Uy ey U a Pies dts, 


Multivariate estes are ealealabee by taking sequentially integrat- 
ing with respect to one variable while holding the remaining constant. 
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Suppose that we integrated with respect to x. Let F\ (#1,...,0%) be 
the partial anti-derivative with respect to 7,, then 


b1 bn 


fi [8.20 dared 


-f. fo DL gerd) Cede = [. fo Cis rte Ba) Akal 


We can continue this iterative process ee teking the partial anti-derivative 
of F, with respect to x2 and so on. It does not matter which definite 
partial integral that we ee first. 


EXAMPLE 12.8. Consider jfe ydxdy. Now we begin by com- 


dole 


21 1 1 

puting Fi(a,y) = ga%y. Then f faydxdy = f Sydy — f sydy = 
1 1 1 
3 


2 2 


7. Probability Theory 


As we saw in chapter 3, models of decision making under uncer- 
tainty are heavily dependent upon probability theory. In this section, 
we outlines the basics of probability and review some of the basic re- 
sults. 


7.1. Outcomes and Events. The building blocks of probability 
theory are outcomes and events. Let S' is the set of all possible out- 
comes that can be generated by a random process. Such a set is known 
as a sample space. A generic element s € S is called an outcome. 


EXAMPLE 12.9. Flipping two coins. S = {HH,HT,TH,TT}. 
EXAMPLE 12.10. Unemployment rates: S = [0, 100] 


The first example is that of a discrete sample space because the 
number of outcomes is finite while the latter is a continuous sample 
space as the number of outcomes is infinite. 

Given a sample space, we define an event as asubset A CS. Thus, 
an event is any combination of outcomes. 

EXAMPLE 12.11. A = {TH,HT} “flip two is different that flip 


one” 


EXAMPLE 12.12. A = [4,13] “unemployment is between 4 and 13%” 
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7.2. The Axioms of Probability Theory. Probability theory 
concerns itself with the likelihood that various events occur. We let 
Pr(A) denote be the probability of event A occurs. Classical prob- 


ability theory is based one the following axiomatic statements about 
Pr(A). 


AXIOM 12.1. For any event A, Pr(A) > 0. 
AXIOM 12.2. Pr(S) =1 


AXIOM 12.3. Let Aj, Ao, ... be an infinite sequence of disjoint 
events then Pr @ A; | = yy Pr( Ay. 
i=l i=l 


Axiom 1 says simply that the probability of any event is non- 
negative while axiom 2 says that the probability that some event occurs 
is 1. Axiom 3 concerns the probability of mutually exclusive or disjoint 
events. It states that the probability of one of an infinite set of mutu- 
ally exclusive events is equal to the sum of the individual events. These 
axioms lead directly to a number of useful properties of probabilities.. 

The probability of the null event is zero. 


THEOREM 12.9. Pr(¢) = 0. 


Axiom 3 extends directly to the case of a finite number of disjoint 
events. 


THEOREM 12.10. Let Ay, Ao, ..., Ay be a finite sequence of disjoint 
events then Pr ( A\) = >> Pr(A,). 
i=l i=l 


The previous theorem plus axioms | and 2 imply that the proba- 
bilities of an event and its complement sum to one. 


THEOREM 12.11. Let S|A be the complement of A, then Pr(A) + 
Pr(S|A) = 1. 


A direct implication of the previous theorem is that the probability 
of any event is less than one. 


THEOREM 12.12. For any event A, 0 < Pr(A) <1. 


If the outcomes associated with event B are a proper subset of those 
associated with event A, the probability of event A have to be at least 
as large as the probability of B. 


THEOREM 12.13. If AC B, then Pr(A) > Pr(B). 


326 12. MATHEMATICAL APPENDIX 


The next two theorems concern the probability of a union of events. 
With just two events A and B, we can decompose A U B into three 
disjoint sets A|B, B|A, and AN B. Thus, Pr(AU B) = Pr(A|B) + 
Pr(B|A)+Pr(ANB) from Theorem 12.10. Theorem 12.10 also suggests 
that Pr(A) = Pr(A|B)+Pr(AN B) and Pr(B) = Pr(B|A)+Pr(ANB). 
These results produce Theorem 12.14. 


THEOREM 12.14. For any two events A and B, Pr(AU B) = 
Pr(A) + Pr(B) — Pr(AnN B). 


Theorem 12.15 is a straightforward generalization of Theorem 12.14. 


THEOREM 12.15. For any n events A,, Ag, ..-, An, 
i=1 i=l ji k>j>i 


7.2.1. Dependence and Conditional Probability. We now turn to the 
question of how the likelihood of distinct events are related. We are 
concerned with whether the occurrence of one event effects the prob- 
ability of another. Consider two events A and B. Suppose we know 
that event B has occurred what is the probability that event A will 
occur? 

One obvious possibility is that the likelihoods of the events are 
unrelated. We say that two events A and B are independent if Pr(AN 
B) = Pr(A) Pr(B). When events are independent, the realization of 
one event has no effect on the probability of the other. Suppose that 
event B occurs, then Pr(AM B) simply Pr(A). Thus, the occurrence 
of A is not effected by the occurrence of B. This logic extends to a 
general definition of independence. 


DEFINITION 12.32. Let Aj, 
independent if Pr (f A) = [|] Pr(A)). 
i=1 i=1 


...,An be a set of events. They are 


For this to be true, any subset of the events must also be indepen- 
dent. See DeGroot and Schervish. (2001) for an example where 3 
events are pairwise independent but not independent. 

Now we turn to cases where there is dependency among events. 
A key concept for analyzing such relationships is that of conditional 
probability. Given two events A and B, the conditional probability of 
A given B is the probability that A occurs given that B has occurred. 
We denote the conditional probability of A given event B as: 


Pr(ANn B) 


Pr(AIB) = om 


assuming Pr(B) > 0 
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Note that if A and B are independent, Pr(A|B) = Pr(A). It is also 
easy to see that conditional probabilities must satisfy Pr(A MB) = 
Pr(A|B) Pr(B) = Pr(B|A) Pr(A). This multiplication rule generalizes 


to sg 
Pr (n 4 = Pr(A)) II Pr (a () 4 


v 


7.3. Bayes’ Theorem. One of the most important uses of proba- 
bility theory in political game theory is its predictions about how agents 
use observed events to make assessments about the probability of unob- 
served events. Bayes’ Theorem specifies exactly how such assessments 
are formed. Before stating and proving this theorem, we need an ad- 
ditional definition. A partition is simply a group of mutually exclusive 
events that cover the entire sample space. 


DEFINITION 12.33. A partition of a sample space S is a set of dis- 
k 
joint events Aj,..,A, such that Pr (U A) =o: 
i=l 


We can now state Bayes’ Theorem. 
THEOREM 12.16. If Aj,..,A, form a partition of S, Pr(B) > 0 
Pr(A;) > 0 for alli 


Pr(A,|B) ae Pr(A;) Pr(BAj) 


y Pr(A\) Pr(B\A;) 


PROOF. The proof proceeds in a number of steps. 
Claim 1: If Aj,..,A, is a partition of S and B is a subset of S, the 
sets A, B,..., A, AB form a partition of B. This follows from the 
k k 
fact that U (Ain B) = UVAiNB=SNB=B. 


i=l i=1 
k 
Claim 2: If Aj,..,A; form a partition of S, Pr(B) = S> Pr(A;N B). 


1 
This is a direct application of Claim 1 and Theorem 12.10. 
Claim 3: If Aj,..,A, form a partition of S and Pr(A;) > 0 for all 


k 
i, then Pr(B) = S> Pr(B|A;) Pr(A;). This is an application of Claim 
i=1 


2 and the multiplication rule for conditional probabilities. 

Now we can prove the main result. Note that if Pr(B) > 0, the 
definition of conditional probability implies that 
Pr(A; NM B) 


Pr(A;|B) 7 Pr(B) 
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Bayes’ Law follows from the substitution of Pr(A;1B) = Pr(B|A;) Pr(A;) 
k 

for the numerator and Pr(B) = S> Pr(B|A;) Pr(A;) for the denomina- 


al 


tor. LC] 


7.4. Random Variables and Distributions. It is often conve- 
nient to use numerical representations of outcomes and events. Such 
representations are known as random variables. A random variable is 
simply a function that maps all possible outcomes into real numbers. 


DEFINITION 12.34. Let X : S — R for some sample space S. Then 
X is a random variable that assigns a real number X(s) to each 
possible outcome s € S. 


Given this definition of random variables, it is straightforward to 
define events as sets of real numbers and to define probability dis- 
tributions over the random variables. A distribution is simply an 
assignment of probabilities to such events. 


DEFINITION 12.35. Let A be any subset of R and let Pr(X € A) de- 
note the probability that X is in A. Then Pr(X € A) = Pr{s: X(s) € 
A}. A probability distribution of X is an specification of Pr(X € A) 
forall ACR. 


7.4.1. Discrete Distributions. A random variable X has a discrete 
distribution if it can take on only a finite number of outcomes: 11, 12,..., Up. 
We call this set of possible outcomes the support of the distribution. 


DEFINITION 12.36. If a random variable has a discrete distribution, 
the probability function (pf.) f of X is defined as f(x) = Pr(X = x) 
for any real number x. 


If x is not equal to one of the points in the support of X, then 
f(z) = 0. By the axioms of probability theory, we know that 0 < 
k 
yy ie) = and 0S Fe) 1. 
i=1 

EXAMPLE 12.13. The Uniform Distribution over Integers: Suppose 
that the value of X is equally likely to be one of k integers 1,2,3,...k. 
Then the pf is 
7 FOr = Teeth 
0 otherwise 


fe) ={ 


EXAMPLE 12.14. The Binomial Distribution: Suppose an experi- 
ment succeeds with probability p and fails with probability 1—p. The 
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pf for x successes out of n trials is given by: 


poy ={ 5 QU ra 


0 otherwise 


where ©) a nay: 


7.4.2. Continuous Distributions. Suppose that XY can take on an 
infinite number of values. Then we say that X is a continuous random 
variable. If X is a continuous random variable, then there exists a 
non-negative function f such that for any interval A = |a, b] 


Pr(X € A) = , Oe | fle)de 


The function f is known as the probability density function (or pdf). 
It does not tell us the Pr(X = x) (which is 0) but the lim Pr(X € 
[x — €,x + €]) as € goes to 0. Every pdf must satisfy the following: 


f(x) > 0 for all x 


| tows =i 


The set X* = {x : f(x) > 0} is known as the support of X. 


EXAMPLE 12.15. The Uniform Distribution on an Interval: Let a 
and b be two real numbers. Consider an experiment where in which a 
point X is chosen from S = [a,b] where the probability that X belongs 
to any subinterval is proportional to the length of that subinterval. This 
implies that the pdf must be the same on any point in S and 0 otherwise. 

b 


lo) b 
Thus, [ f(a)dx = f f(a)dx = [cdx =1. Solving the integral for c, 


we obtain that c —_e Thus, the pdf for the uniform distribution is 


fa) = { pL for x € [a, | 


0 otherwise 


7.4.3. The Cumulative Distribution Function. The cumulative dis- 
tribution function (cdf) is a real-valued function that relates for any 
real number the probability that X takes on a lower value: 


F(a)= Pr(X < Zz) 
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For discrete distributions, F(z) = >> f(a;). For continuous dis- 
{i:a;<a} 


tributions, F(z) = f f(€)dg. Note that at any point in which F is 


differentiable, we have F’(a) = f(x). Thus, continuous random vari- 
able can be represented either by the pdf of the cdf. 
The following are some important properties of the cdf. 
(1) Pr(X > x) =1—- F(z). 
(2) Pr(ae >X > 1) — F (a2) _ F(x). 
(3) F is non-decreasing i.e. if ry > x, then F(x2) > F(x). 
(4) Jim Pu) =0-and ime) = 1. 
(5) F is always continuous from the right. It may be discontinu- 
ous from the left at x if « occurs with a positive probability. 
See Figure 12.8. 


Insert Figure 12.8 Here 


7.4.4. Bivariate Distributions. Sometimes we will be concerned with 
the probabilities of two or more random variables, say X and Y, si- 
multaneously. One useful tool for analyzing two random variables is 
the joint distribution which characterizes the probability of pairs of 
realizations of X and Y. 

Discrete Bivariate Probability Functions. For the case of two dis- 
crete random variables, the bivariate probability function is given by 


Jeg) = Puke =e and y Hyh. 


Let 71,....,2, and y,...,Y%m be the support of X and Y respectively, 
then f(x,y) must satisfy the following properties: 
kom 
Q) 53 fleww) =1, 
t=1 j= 
(2) Let A be any set of combinations of {7 ,....,2,} and {y1,---,Ym} 
then Pr{(z,y)€ AS}= So f(zi,y;). 
A 


Continuous Bivariate Density Functions. If X and Y are continuous 
random variables, the bivariate density is defined by 


Pr{(a,y) € A} = ff f(a,y)dedy 


for any A C R?. The bivariate pdf must satisfy the following properties: 
(1) For any (2,y) € R?, f(x,y) > 0. 


(2) Jf f(a,y) = 1. 
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Bivariate Distribution Function. We can also generalize the notion 
of the cdf to bivariate distributions. The joint distribution function 
can be denoted by 


Fle,y) =Pr{e< Xx andy <Y} 
We can use the joint distribution function to determine the probability 
that (x,y) lies in a rectangle [a, b)[c, dj 


Prax X <tb&cxY <d)=Pra<X <b&Y <d)—Pria< X < b&Y <c) 


=Pr(x<b&Y <d)—Pr(x <a&Y <d)- 
Prax <b& Y <c)—Pr(x <a&Y <c) 


= F(b,d) — F(a,d) — F(b,c) + F(a,c) 


We can derive the distribution functions of X and Y (F;, and F,) from 
the joint distribution function: 


F(z) = lim F(z, y) 


yoo 


F,(y) = lim F(z, y) 


LOO 


7.4.5. Marginal Distributions. Suppose we know the joint pdf of X 
and Y. Wecan get the probability density of each of them individually. 
These distributions of the individual random variables are know as 
the marginal distributions. For discrete distributions, the marginal 
probability functions f, and fy are defined by 


Pie =o) = 7.49) So Pr(x San gay) = S- f(y) 


PY =y) = f,0)= > Pr@=X andy=Y)=)_ f(a,¥) 


For continuous random variables, the marginal density functions are 


fle) = f Fasu)dy 


fw) = f fle.spde 
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7.5. Independent Random Variables. We know that if X and 
Y are independent random variables, the Pr(X = x and Y = y) = 
Pr(X = 2) Pr(Y = y). This implies that 


where Ff, and Fy are the marginal cdfs. It is also true that 


f(x,y) = fax) fy(@) 


where f, and f, are marginal probability (density) functions. 


7.6. Conditional Distributions. Suppose that X and Y are not 
independent. Then we can define conditional distributions of X given 
Y and Y given X. The derivation of the conditional distributions fol- 
low directly from the definition of conditional probability given above. 
For the the discrete case, the conditional probability functions g,(z|y) 
and g,(y|x) are defined as follows. 


gu(x|y) = Pr(X =a|Y =y) = —C 


PES y) fy(y) 
Pr(X = ole ae ‘ 
sy(ylt) = Pr(Y = ylX =a) = AA 
For continuous random variables, 
_ flay) 
Ju (xy) aa fyly) 
f(x,y) 


Jy(y|z) = “fia(x) 


7.7. The Expectation of a Random Variable. One of the most 
important features of any probability distribution is its expectation or 
central tendency. The expectation of a random variable is the average 
over all of its realizations weighted by its probability. For discrete 
distributions, the expectation of X or E(X) is defined as 


E(X) = af(2). 


For continuous distributions, 


Bia | floc 
Often in this book, we will be interested in expectations of functions 


of a random variable. Let Y =r(X), then E(Y) = f[ r(x) f(x)dz. 


—oo 
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The expectation functions must satisfy the following properties: 
(1) IfY =a+6X, then E(Y) =a+ bE(X). 
(2) If there exists a such that Pr(X >a) =1, then E(X) >a. If 
there exists b such that Pr(X < 6) =1, E(X) <b. 
(3) If.Xq,..., Xp are random variables, E(X)+...+X,) = E(X1)+ 
. $ E(X,). 
(4) If X1,...,X, are INDEPENDENT random variables, then 


E (i x,) = I E(X). 


7.8. The Variance of a Random Variable. Another important 
property of a random variable is the extent to which it deviates from 
its expected value. One such measure is the variance defined as 


var(X) = 0% = E|(X — E(X))’). 
The variance function must satisfy a number of properties. 


(1) If there exists c such that Pr(X = c) = 1, var(X) =0. 

2) For any constants a and b, var(a + bX) = b?var(X) 

) For any random variable X, var(X) = E(X?) — [E(X)|? 

) If X1,...,X, are INDEPENDENT random variables, then 


var (= x,) = 3 var(X)). 


7.9. The Median and the Mode. Two other important function 
that help to summarize random variables are the median and the mode. 

The Median: Let F' be the cdf of X. A point m is the median of X 
if and only if Pr(X <m) > .5 and Pr(X >m) < .5 or (for continuous 
distributions) F'(m) = .5. 

The Mode: Let f be the pf or pdf of X. Then a number m is a 
mode of X if and only if m € arg max f(x). 


7.10. Covariance and Correlation. Given a joint distribution 
over (X,Y), we are often interested in describing the relationship be- 
tween X and Y. In particular, we would like to know the extent to 
which they move together or covary. To this end, the covariance is 
defined as 


c00(X,¥) = ony = B [(X — ty) (¥ ~ p)] 
where pi, = E(X) and p, = E(Y). 
If X and Y move “together”, the covariance is the expectation of 
a positive function and therefore positive. If X and Y move “against 


one another”, the covariance is the expectation of a negative function 
and is therefore negative. 
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The covariance has a scaling problem. Consider the covariance of X 
and Z =aY +b. It is straightforward to show that o,, = aozy. Thus, 
the covariance depends on how variables are scaled. The correlation 
adjusts for this problem. Formally, the correlation between X and Y 
is Coy 
Try 
Covariances and correlation coefficients have to satisfy the following 
properties. 


Pry a 


(1) For any random variables X and Y with finite variances, 1 > 
Pry wl 

(2) For any random variables X and Y, g2y = E(XY) — bby. 

(3) For independent random variables X and Y with finite vari- 
ANCES, Ory = Poy = 0. 

(4) For random variable X with a finite variance and Y = aX +, 
Poy =lifa>0 and p,, =—1ifa<0. 

(5) For any random variables X and Y with finite variances, var(X + 
Y) =03 +04 + Gay. 


n 


(6) If X,,..., X, are random variables with finite variances, var(S~> X;) 


i=1 


DD a7 +2 > 2 Oe: 
=1j=i4+1 
7.11. Conditional Expectation. Often we are interested in com- 
puting expectations of random variables conditioned on the outcomes 
of other random variables. The conditional expectation function is 
defined as: 


E(Y|z) = 2 vou y|x) 
Ee) =f voy(ule)ay 
The conditional expectation is function of X and has a distribution de- 


rived from the distribution of X. An important property of conditional 
expectations is the Law of Iterated Expectations. 


E(E(Y|X)) = E(Y) 
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Equilibrium Policies from Romer-Rosenthal Game 
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Figure 10.1: Nash Bargaining Solution 
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Figure 10.2: Veto Bargaining with Incomplete Information 
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Figure 10.3: Proposals in Cheap-talk Game 
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Conditions for Equilibrium Vetoes in the Blame Game Model 
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Figure 11.1: Policy Outcomes from the Epstein- 
O’ Halloran Model 
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Figure 11.2: Policy Choice in Huber- 
McCarty Model 
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